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35.45 SwissProt 35) evidenced some degree of homology between the PR0718 amino acid sequence and the 
following Dayhoff sequences: AF045606J, AF039906J, SPBC8D22, S63441, F64728, COXITRYBB, 
F64375, E64173, RPYGJT3, MTCY261_23. 

EXAMPLE 26 : Isolation of cDNA clones Encoding Human PRQ872 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence designated herein as clul 20709. init. The clu 120709. init sequence was then compared a 

proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 

homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

et aL, Methods in Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 

consensus sequence obtained therefrom is herein designated DNA48254. 

In light of an observed sequence homology between the DNA48254 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3438068, the Incyte EST clone 3438068 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 63 and is the full-length DNA sequence for PR0872. 

Clone DNA49819-1439 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 

209931. 

The entire nucleotide sequence of DNA49819-1439 is shown in Figure 63 (SEQ ID NO: 1 12). Clone 
20 DNA49819-1439 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 14-16 and ending at the stop codon at nucleotide positions 1844-1846 (Figure 63). The predicted 
polypeptide precursor is 610 amino acids long (Figure 64). The full-length PR0872 protein shown in Figure 
64 has an estimated molecular weight of about 66,820 daltons and a pi of about 8.65. Analysis of the full-length 
PR0872 sequence shown in Figure 64 (SEQ ID NO: 113) evidences the presence of the following features: a 
25 signal peptide at amino acid 1 to about 18, putative transmembrane domains at about amino acids 70-87, 200-222 
and 568-588; sequence identity with bacterial -type phytoene dehydrogenase protein at about amino acids 71-105; 
sequence identity with a regulator of chromosome condensation (RCC1) signature 2 at about amino acids 201- 
211; leucine zipper patterns at about amino acids 214-235, 221-242, 228-249 and 364-385; a potential N- 
glycosylation site at about amino acids 271-274; and a glycosaminoglycan attachment site at about amino acids 
30 75-78. Analysis of the amino acid sequence of the full-length PR0872 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PR0872 amino acid sequence and the following 
Dayhoff sequences: PRCRTI J y S75951, S74689, CELF37C4 3, CRTIRHOCA, S76617, YNI2J4ETTL, 
MTV014 14, AOFB HUMAN, and MM U 70429 1. 

35 EXAMPLE 27 : Isolation of cDNA clones Encodine Human PRO 1063 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as ss. clul 19743. init. The Incyte EST cluster sequence 
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ss. dull 9743. init sequence was then compared to a variety of expressed sequence tag (EST) databases which 
included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul etaL, Methods in Enzvmology 266:460-480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
5 proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
: designated DNA48288. 

^ In light of an observed sequence homology between the DNA48288 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2783726, the Incyte EST clone 2783726 was purchased 
10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 65 and is herein designated DNA49820-1427. 

The full length clone shown in Figure 65 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 90-92 and ending at the stop codon found at nucleotide positions 
993-995 (Figure 65; SEQ ID NO.114). The predicted polypeptide precursor is 301 amino acids long, has a 
15 calculated molecular weight of approximately 33,530 dahons and an estimated pi of approximately 4,80. 
Analysis of the full-length PRO 1063 sequence shown in Figure 66 (SEQ ID NO: 115) evidences the presence of 
jy the following: a signal peptide from about amino acid 1 to about amino acid 21, potential N-giycosylation sites 

^> from about amino acid 195 to about amino acid 198, from about amino acid 217 to about amino acid 220 and 

from about amino acid 272 to about amino acid 275, a glycosaminoglycan attachment site from about amino acid 
20 267 to about amino acid 270, a microbodies C-terminal targeting signal site from about amino acid 299 to about 
amino acid 301, a type II fibronectin collagen-binding domain homology sequence from about amino acid 127 
to about amino acid 168 and a fructose -bisphosphate aldolase class II protein homology sequence from about 
amino acid 101 to about amino acid 118. Clone DNA49820-1427 has been deposited with the ATCC on June 
2, 1998 and is a signed ATCC deposit no. 209932. 
25 Analysis of the amino acid sequence of the full-length PRO 1063 polypeptide suggests that it possesses 

sequence similarity to the human type IV collagenase protein. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO 1063 amino acid 
sequence and the following Dayhoff sequences, S68303, CFU68533 1, PJ>91139, RNU65656J, 
Pf PA2R JRABIT . MMU56734J, FINCXENLA, A48925, P_R92778 and FAI2_HUMAN. 

30 

EXAMPLE 28 : Isolation of cDNA clones Encoding Human PRQ619 
- Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated herein as 88434. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
35 GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
^ (Altshul et ah, Methods in Enzvmoiogy 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
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of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1656694, the Incyte EST clone 1656694 was purchased and the 
5 cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 67 and is herein designated as DNA4982 1-1562. 

The full length clone shown in Figure 67 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 81-83 and ending at the stop codon found at nucleotide positions 
450-452 (Figure 67; SEQ ID NO: 116). The predicted polypeptide precursor (Figure 68, SEQ ID NO: 117) is 
10 123 amino acids long including a predicted signal peptide at about amino acids 1-20. PR0619 has a calculated 
molecular weight of approximately 13,710 daltons and an estimated pi of approximately 5.19. Clone 
DNA4982M562 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit no. 209981. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 68 (SEQ ID NO: 117), revealed significant 
15 homology between the PR0619 amino acid sequence and the following Dayhoff sequences: S35302, D87009_l , 
HSU93494J, HUMIGLAM5 1, D86999_2, HUMIGLYM HUMIGLYMKEJ , A29491J, A29498J, 
and VPR2JvIOUSE, 

EXAMPLE 29 : Isolation of cDNA clones Encoding Human PRQ943 

20 A consensus DNA sequence encoding PR0943 was assembled relative to other EST sequences using 

phrap as described in Example 1 above. This consensus sequence was then extended using repeated cycles of 
BLAST and phrap to extend the consensus sequence as far as possible using the sources of EST sequences 
discussed above. The extended consensus sequence is herein designated DNA36360. Based on the DNA36360 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 

25 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0943. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5*-CGAGATGACGCCGAGCCCCC-3 * (SEQ ID NO: 120) 
reverse PCR primer 5'-CGGTTCGACACGcGGCAGGTG-3' (SEQ ID NO: 121) 
30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36360 
sequence which had the following nucleotide sequence 
hybridization probe 

5 ' -TGCTGCTCCTGCTGCCGCCGCTGCTGCTGGGGGCCTTCCCGCCGG -3 ' (SEQ ID NO: 122) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0943 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0943 (designated herein as DN A52192- 1369 [Figure 69, SEQ ID NO: 1 18]) and the derived protein sequence 
for PR0943. 

The entire nucleotide sequence of DNA52192-1369 is shown in Figure 69 (SEQ ID NO;118). Clone 
DNA52 192- 1369 contains a single open reading frame with an apparent translational initiation site at nucleotide 
5 positions 150-152 and ending at the stop codon at nucleotide positions 1662-1664 (Figure 69), The predicted 
polypeptide precursor is 504 amino acids long (Figure 70), The full-length PR0943 protein shown in Figure 
70 has an estimated molecular weight of about 54,537 daltons and a pi of about 10.04. Analysis of the full- 
length PR0943 sequence shown in Figure 70 (SEQ ID NO: 1 19) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 17, a transmembrane domain from about amino acid 376 

10 to about amino acid 396, tyrosine kinase phosphorylation sites from about amino acid 212 to about amino acid 
219 and from about amino acid 329 to about amino acid 336, potential N-glycosylation sites from about amino 
acid 1 1 1 to about amino acid 1 14, from about amino acid 231 to about amino acid 234, from about amino acid 
255 to about amino acid 258 and from about amino acid 293 to about amino acid 296 and an immunoglobulin 
and MHC protein sequence homology block from about amino acid 219 to about amino acid 236. Clone 

15 DNA52192-1369 has been deposited with ATCC on July 1, 1998 and is assigned ATCC deposit no. 203042. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 70 (SEQ ID NO: 119), evidenced significant 
homology between the PR0943 amino acid sequence and the following Dayhoff sequences: B49151, A39752, 
FGR1JCENLA, S38579, RATHBFGFRBl , TVHU2F, FGR2 MOUSE, CEK3 CHICK, PJR21080 and 

20 A27171J. 

EXAMPLE 30 : Isolation of cDNA clones Encoding Human PRO U 88 

A consensus DNA sequence was assembled relative to other EST sequences using the program "phrap" 
as described in Example 1 above. This consensus sequence is designated herein as DNA45679. Based on the 
25 DNA45679 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that 
contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence 
for PROH88. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CTGGTGCCTCAACAGGOAGCAOr (SEQ ID NO: 125) 
30 reverse PCR primer 5'-CCATTGTGCAGGTCAGGTCACAG-3 ' (SEQ ID NO: 126) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45679 sequence which had the following nucleotide sequence: 
hybridization probe 

5 ' -CTGGAGC AAGTGCTC AGCTGCCTGTGGTC AG ACTGGGGTC-3 ' (SEQ ID NO: 127) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 188 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01188 (designated herein as DNA52598-1518 [Figure 71, SEQ ID NO: 123]); and the derived protein 
sequence for PRO 1188. 

The entire coding sequence of PRO 1 188 is shown in Figure 7 1 (SEQ ID NO: 123). Clone DN A52598- 
5 1518 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
136-138 and an apparent stop codon at nucleotide positions 3688-3690. The predicted polypeptide precursor 
is 1184 amino acids long. The full-length PRO 1188 protein shown in Figure 72 has an estimated molecular 
weight of about 132,582 Daltons and a pi of about 8.80. Additional features include: a signal peptide at about 
amino acids 1-31; an ATP/GTP binding site motif A (P-Ioop) at about amino acids 266-273; an aldehyde 
10 dehydrogenases cysteine active site at about amino acids 188-199; growth factor and cytokines receptors family 
signature 2 at about amino acids 153-159; and potential N-glycosylation sites at about amino acids 129-132, 132- 
135, 346-349, 420-423, 550-553, 631-634, 1000-1003, and 1056-1059. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 72 (SEQ ID NO: 124), revealed significant 
15 homology between the PR01188 amino acid sequence and the following Dayhoff sequences; SSU83114_1, 
S56015, CET21B6_4, CELT19D2J, and TSPl_MOUSE. 

Clone DNA52598-1518 has been deposited with ATCC and is assigned ATCC deposit no 203107. 

EXAMPLE 31 : Isolation of cDNA clones Encoding Human PRO 1 133 

20 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This sequence was extended using repeated cycles of phrap. The extended consensus 
sequence is designated herein DNA38102. Based on the DNA38102 consensus sequence, oligonucleotides were 
synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as 
probes to isolate a clone of the full-length coding sequence for PROl 133. 

25 PCR primers (two forward and one reverse) were synthesized: 

forward PCR primer 1 5'-TCGATTATGGACGAACATGGCAGC-3* (SEQ ID NO: 130); 
forward PCR primer 2 5'-TTCTGAGATCCCTCATCCTC-3' (SEQ ID NO:131); and 
reverse primer 5 ' -AGGTTC AGGG AC AGC AAGTTTGGG-3 ' (SEQ ID NO:132). 

Addi'^nally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 

30 DNA38102 sequence which had the following nucleotide sequence: 
hybridization probe 

5 , TTTGCTGGACCTCGGCTACGGAATTGGCTTCCCTCTACGGACAGCTGGAT3 , (SEQ ID NO: 133). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with a PCR primer pair identified above. A positive library was then used to 
35 isolate clones encoding the PROl 133 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 133 and the derived protein sequence for PRO 1 133. 

The entire coding sequence of PROl 133 is shown in Figure 73 (SEQ ID NO: 128). Clone DNA53913- 
1490 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
266-268 and an apparent stop codon at nucleotide positions 1580-1582 of SEQ ID NO: 128. The predicted 
5 polypeptide precursor is 438 amino acids long. The signal peptide is at amino acids 1-18 of SEQ ID NO: 129. 
EGF-like domain cysteine pattern signatures start at 315 and 385 of SEQ ID NO: 129 as shown in Figure 74. 
Clone DNA53913-1490 has been deposited with ATCC and is assigned ATCC deposit no. 203162. The full- 
length PROl 133 protein shown in Figure 74 has an estimated molecular weight of about 49,260 daltons and a 
pi of about 6.15. 

10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 74 (SEQ ID NO: 129), revealed some sequence 
identity between the PRO U 33 amino acid sequence and the following Dayhoff sequences (data from the database 
incorporated herein): AF002717J, LMG1 HUMAN, B54665, UNC6_CAEEL, LML1JTAEEL, 
LMA5J40USE, MMU88353J, LMA1_HUMAN, HSLN2C64 1 and AF005258J. 

15 

EXAMPLE 32 : Isolation of cDNA clones Encoding Human PRQ784 

An initial DNA sequence (SEQ ID NO: 136), referred to herein as DNA44661 and shown in Figure 77, 

was identified using a yeast screen, in a human fetal lung cDNA library that preferentially represents the 5 f ends 

of the primary cDNA clones, DNA44661 was then compared to ESTs from public databases (e,g., GenBank), 

20 and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA), using the computer 

program BLAST or BLAST2 [Altschul et al., Methods in Enzvmology. 266:460-480 (1996)]. The ESTs were 

then clustered and assembled into a consensus DNA sequence using the computer program "phrap" (Phil Green, 

University of Washington, Seattle, Washington). The consensus sequence obtained is designated herein as 

"DNA45463". Based on the DNA45463 consensus sequence, oligonucleotides were synthesized for use as 

25 probes to isolate a clone of the full-length coding sequence for PR0784 from a human fetal lung cDNA library. 

v 

The full length DNA53978-1443 clone shown in Figure 75 contained a single open reaoing frame with 
an apparent translational initiation site at nucleotide positions 37-39 and ending at the stop codon found at 
nucleotide positions 821-823 (Figure 75; SEQ ID NO: 134). The predicted polypeptide precursor (Figure 76, 
SEQ ID NO; 135) is 2?° -rniino acids long. PR0784 has a calculated molecular weight of approximately 25,735 

30 Daltons and an estimated pi of approximately 5.45. PR0784 has the following features: a signal peptide at about 
amino acid 1 to about 15; transmembrane domains at about amino acids 68 to about 87 and at about 183 to about 
204; potential N-myristoylation sites at about amino acids 15-20, 51-56, 66-60, 163-168, and 206-211; and an 
RNP-1 protein RNA-binding region at about amino acids 108 to about 117. 

Clone DNA53978-1443 was deposited with ATCC on June 16, 1998, and is assigned ATCC deposit 

35 no, 209983. 

Based on a BLAST and FastA sequence alignment analysis (using the ALIGN computer program) of 
the full-length sequence, PR0784 shows amino acid sequence identity to the following proteins: RNU42209 1, 
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MMU91538_1, CGU91742J, CELF55A4 6, SC22JTEAST, and F48188. 

EXAMPLE 33 : Isolation of cDN A Clones Encoding Human PRQ783 

A yeast screening assay was employed to identify cDNA clones that encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDNA clone, designated herein as DNA45201 
5 (Figure 80; SEQ ID NO: 1 39) . 

The DNA45201 sequence was then used to search expressed sequence tag (EST) databases for the 
presence of potential homologies. The EST databases included public EST databases (e.g., GenBank) and a 
proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was 
performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460- 

10 480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" 
(Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained is herein designated 
as "consenOl". A proprietary Genentech EST sequence was used in the consensus assembly and is herein 
designated as DNA14575 (Figure 81; SEQ ID NO: 140). 

15 Based on the consenOl sequence , oligonucleotides were synthesized: 1) to identify by PCR a cDNA 

library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0783- In order to screen several libraries for a full-length clone, DNA from the libraries was 
screened by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR 
primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe 

20 oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-GACTGTATCTGAGCCCCAGACTGC-3' (SEQ ID NO: 141), 
forward PCR primer 5 ' -TC AGC A ATG AGGTGCTGCTC -3 ' (SEQ ID NO: 142), and 
reverse PCR pnmer 5'-TGAGGAAGATGAGGGACAGGTTGG-3 ' (SEQ ID NO: 143). 

25 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consenOl 

sequence which had the following nucleotide sequence: 
hybridization probe 

5 ' -TATGGA AGC ACCTGACTACGAAGTGCTATCCGTGCG AG AAC AGCTATTCC-3 ' (SEQ ID NO: 144). 
In ordu to screen several libraries for a source of a full-length clone, DNA from the libraries was 

30 screened by PCR amplification with a PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0783 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB228). 
The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 

35 containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science , 
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253 : 1 278- 1 280 ( 1 99 1 )) in the unique Xhol and NotI sites . 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0783 [herein designated as DNA53996-1442] (SEQ ID NO: 137) and the derived protein sequence for 
PR0783. 

The entire nucleotide sequence of DNA53996-1442 is shown in Figure 78 (SEQ ID NO: 137). Clone 
5 DN A53996- 1442 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 310-312 and ending at the stop codon at nucleotide positions 1777-1779 (Figure 78). The predicted 
polypeptide precursor is 489 amino acids long (Figure 79). The full-length PR0783 protein shown in Figure 
79 has an estimated molecular weight of about 55,219 daltons and a pi of about 8.47. Analysis of the full-length 
PR0783 sequence shown in Figure 79 (SEQ ID NO: 138) evidences the presence of the following features: 
10 transmembrane domains located at about amino acids 23-42. 67-89, 111-135, 154-176, 194-218, 296-319, 
348-370, 387-410 and 427-452; leucine zipper patterns located at about amino acids 263-283 and 399-420; a 
potential tyrosine kinase phosphorylation site at about amino acids 180-187; potential N-glycosylation sites at 
about amino acids 105 -108 and 121-124; potential cAMP- and a cGMP-dependent protein kinase 
phosphorylation site at about ammo acids 288-291; and a region having sequence identity with bacterial 
15 rhodopsins retinal binding site protein at about amino acids 190-218. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35) shows some sequence identity 
between the PR0783 amino acid sequence and the following Dayhoff sequences: YNC2CAEEL, D64048, 
ATACO02332JF4P9,3, NY2RSHEEP, and VSH_MUMPA. 

Clone DNA53996-1442 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit 
20 no. 209921. 

EXAMPLE 34 : Isolation of cDNA Clones Encoding Human PRO820 

An expressed sequence tag (EST) DNA database (Merck/ Wash. U) was searched and an EST designated 
EST no. AA50408O, Merck clone 825 136, was identified (library 312, human B-cell tonsil). Homology searches 

25 revealed that this EST showed sequence identity with low affinity immunoglobulin gamma Fc receptor II. DNA 
sequencing gave the full-length DNA sequence for PRO820 and the derived protein sequence for PRO820. 

The entire nucleotide sequence of DNA56041-1416 is shown in Figure 82 (SLQ ID NO: 145). Clone 
DNA56041-1416 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 115-117 and ending at the stop codon at nucleotide positions 487-489 (Figure 82). The predicted 

30 polypeptide precursor is 124 amino acids long (Figure 83). The full-length PRO820 protein shown in Figure 
83 has an estimated molecular weight of about 14,080 daltons and a pi of about 7.48. Clone DNA56041-1416 
has been deposited with ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO: 146, the putative signal peptide is at about amino 

35 acids 1-15 of SEQ ID NO: 146. Protein kinase C phosphorylation sites are at about amino acids 20-22 and 43-45 
of SEQ ID NO:146. An N-myristoylation site is at about amino acids 89-94 of SEQ ID NO: 146. An 
immunoglobulin and major histocompatibility complex domain is at about amino acids 83-90 of SEQ ID NO: 146. 
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The corresponding nucleotides can be routinely determined given the sequences provided herein. 

EXAMPLE 35 : Isolation of cDNA Clones Encoding Human PRO108Q 

A consensus DNA sequence was assembled relative to other EST sequences using phrap and was 
extended using repeated cycles of BLAST and phrap so as to extend the consensus sequence as far as possible 
^ 5 using the sources of the EST sequences as described in Example 1 above. The consensus sequence is designated 

herein as DNA52640. An EST proprietary to Genentech was employed in the consensus assembly and is herein 
| designated as DNA36527 (Figure 86; SEQ ID NO: 149). 

In light of an observed sequence homology between the DNA36527 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 526423, the Merck EST clone 526423 was purchased 
10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 84 and is herein designated as DNA56047-1456. 

The entire nucleotide sequence of DNA56047-1456 is shown in Figure 84 (SEQ ID NO: 147). Clone 
DNA56047-1456 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 159-161 and ending at the stop codon at nucleotide positions 1233-1235 of SEQ ID NO: 147 (Figure 
15 84). The predicted polypeptide precursor is 358 amino acids long (Figure 85). The full-length PRO 1080 protein 
shown in Figure 85 has an estimated molecular weight of about 40,5 14 daltons and a pi of about 6.08. Clone 
DNA56047-1456 has been deposited with ATCC on June 9, 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

20 Also shown in Figure 85 are the approximate locations of the signal peptide, cell attachment site, Nt- 

DnaJ domain signature, region having sequence identity with Nt-DnaJ domain proteins, and N-glycosylation 
sites. The corresponding nucleic acids of these amino acid sequences and others provided herein can be 
routinely determined by the information provided herein. 

25 EXAMPLE 36 : Isolation of cDNA Clones Encoding Human PRO 1079 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above, and is herein designated DNA52714. Based on information provided by the assembly, the 
clone for Merck EST no. H06898 was obtained and sequenced, thereby giving the nucleotide sequence 
designated herein a? DNA56050-1455. The entire nucleotide sequence of DNA56050-1455 is shown in Figure 
30 87 (SEQ ID NO: 150). Clone DNA56050-1455 contains a single open reading frame with an apparent 
translational initiation site at nucleotide positions 183-185 and ending at the stop codon at nucleotide positions 
861-863 (Figure 87). The predicted polypeptide precursor is 226 amino acids long (Figure 88). The full-length 
PRO1079 protein shown in Figure 88 has an estimated molecular weight of about 24,61 1 Daltons and a pi of 
about 4.85. Analysis of the full-length PRO1079 sequence shown in Figure 88 (SEQ ID NO:3) evidences the 
35 presence of the following features: a signal peptide at about amino acid 1-29; potential N-myristoylation sites 
at about amino acids 10-15, and 5 1-56; homology to photosystem I psaG and psaK proteins at about amino acids 
2 to 20; and homology to prolyl endopeptidase family serine proteins at about amino acids 150 to 163. 



407 



BNSDOCID: <WO„ 9963088A2 _L> 



WO 99/63088 



PCT/US99/12252 



Analysis of the amino acid sequence of the full-length PRO 1079 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced some sequence identity between the PRO 1079 amino acid sequence and 
the following Dayhoff sequences: CEK10C3_4, MMU50734J, D69503, AF051 149 J, and VSMP CVMS. 

Clone UNQ536 (DNA56050-1455) was deposited with the ATCC on June 22, 1998, and is assigned 
ATCC deposit no. 20301 1 . 

5 

EXAMPLE 37 : Isolation of cDN A clones Encoding Human PRQ793 

A cDNA clone (DNA561 10-1437) encoding a native human PR0793 polypeptide was identified by a 
yeast screen, in a human skin tumor cDNA library that preferentially represents the 5* ends of the primary 
cDNA clones. The yeast screen employed identified a single EST clone designated herein as DN A50177 (Figure 

10 91 ; SEQ ID NO: 154). The DNA50177 seauence was then compared to various EST databases including public 
EST databases (e.g. t GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, 
CA) to identify homologous EST sequences. The comparison was performed using the computer program 
BLAST or BLAST2 [Aitschul et al., Methods in Enzvmology. 266:460-480 (1996)]. Those comparisons 
resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were 

15 clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of 
Washington, Seattle, Washington). This consensus sequence is herein designated DNA50972, 

In light of an observed sequence homology between the DNA50972 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. N33874, the Merck EST clone N33874 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

20 The sequence of this cDNA insert is shown in Figure 89 and is herein designated as DNA561 10-1437. 

The full-length DNA56 110- 1437 clone shown in Figure 89 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 77-79 and ending at the stop codon at nucleotide 
positions 491-493 (Figure 89). The predicted polypeptide precursor is 138 amino acids long (Figure 90). The 
full-length PR0793 protein shown in Figure 90 has an estimated molecular weight of about 15,426 daltons and 

25 a pi of about 10.67. Analysis of the full-length PR0793 sequence shown in Figure 90 (SEQ ID NO: 153) 
evidences the presence of the following: transmembrane domains from about amino acid 12 to about amino acid 
30, from about amino acid 33 to about amino acid 52, from about amino acid 69 to about amino acid 89 and 
from about amino acid 93 to about amino acid 109, potential N-myristolation sites from about amino acid 11 to 
about amino acid 16, from about amino acid 51 to about amino acid 56 and from about amino acid 1 16 to about 

30 amino acid 121 and an amino acid sequence block having homology to an aminoacyl-transfer RNA synthetase 
class-II protein from about amino acid 49 to about amino acid 59. Clone DNA561 10-1437 has been deposited 
with ATCC on August 11, 1998 and is assigned ATCC deposit no. 2031 13. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 90 (SEQ ID NO: 153), evidenced certain 

35 homology between the PR0793 amino acid sequence and the following Dayhoff sequences: S47453, 
AF015193J2, MTEHGNS9_2, E64030, H69784, D64995, CD53 MOUSE, GEN8006, AE001138 7 and 
COX2_STRPU. 
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EXAMPLE 38 : Isolation of cDNA Clones Encoding Human PRO 10 16 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example I above. The consensus sequence obtained is herein designated DNA53502. 

In light of an observed sequence homology between the DNA53502 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 38680, the Merck EST clone 38680 was purchased and 
5 the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 92. 

The entire nucleotide sequence of DNA561 13-1378 is shown in Figure 92 (SEQ ID NO: 155). Clone 
DNA561 13-1378 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 168-170 and ending at the stop codon at nucleotide positions 1302-1304 (Figure 92). The predicted 
10 po 1 peptide precursor is 378 amino acids long (Figure 93). The full-length PRO 1016 protein shown in Figure 
S'3 has an estimated molecular weight of about 44,021 daltons and a pi of about 9.07. Clone DNA561 13-1378 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 10 16 polypeptide suggests that portions of 
15 it possess sequence identity with acyltransferase, thereby indicating that PRO 1016 may be a novel 
acyltransferase. 

Still analyzing the amino acid sequence of SEQ ID NO: 156, the putative signal peptide is at about amino 
acids 1-18 of SEQ ID NO: 156. The transmembrane domain(s) are at about amino acids 332-352 and 305-330 
of SEQ ID NO: 156. The fructose-bisphosphate aldolase class-II protein homology sequence is at about amino 
20 acids 73-90 of SEQ ID NO: 156. The extradiol ring-cleavage dioxygenase protein is at about amino acids 252- 
275 of SEQ ID NO: 156. The corresponding nucleotides can be routinely determined given the sequences 
provided herein. 

The specific Dayhoff database designation names of sequences to which PRO 10 16 has sequence identity 
with include the following: S52645, P R59712, PJR99249, P R59713, BNAGPATRFl, CELT05H4 15 and 
25 CELZK40J. 

EXAMPLE 39 : Isolation of cDNA Encoding Human PRO1013 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above, ine consensus DNA sequence was then extended using repeated cycles of BLAST and 

30 phrap to extend the consensus sequence as far as possible using the sources of EST sequences. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 3107695, the Incyte EST clone 3107695 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 94 and is herein designated as DNA56410-1414. 

35 The entire nucleotide sequence of DNA56410-1414 is shown in Figure 94 (SEQ ID NO: 157). Clone 

DNA56410-1414 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 17-19 and ending at the stop codon at nucleotide positions 1244-1246 (Figure 94). The predicted 
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polypeptide precursor is 409 amino acids long (Figure 95). The full-length PRO 1013 protein shown in Figure 
95 has an estimated molecular weight of about 46,662 daltons and a pi of about 7.18. Clone DNA56410-1414 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Stilt analyzing the amino acid sequence of SEQ ID NO: 158, the putative signal peptide is at about amino 
v# 5 acids 1-19 of SEQ ID NO: 158, N-giycosylation sites are at about amino acids 75-78 and 322-325 of SEQ ID 

NO: 158. An N-myristpylation site is at about amino acids 184- 189 of SEQ ID NO: 158. A growth factor and 
g cytokine receptor family domain is at about amino acids 134-149 of SEQ ID NO: 158. The corresponding 

i£ nucleotides can be routinely determined given the sequences provided herein. 

Blast analysis showed some sequence identity with other proteins. Specifically, PRO 1013 has some 
10 sequence identity with at least the Dayhoff sequences designated: D63877 1; MHU22019_1 , AE000730_10, and 
AF019079 1. 

EXAMPLE 40 : Isolation of cDNA Clones Encoding Human PRQ937 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. That consensus sequence is herein designated DNA49651. Based on the DNA49651 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
S| the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 

^ PR0937. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5' CTCCGTGGTAAACCCCACAGCCC-3' (SEQ ID NO: 161); and 
reverse PCR primer 5'-TCACATCGATGGGATCCATGACCG-3' (SEQ ID NO: 162). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA48651 sequence 
which had the following nucleotide sequence: 
hybridization probe 

25 5'-GGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAG-3' (SEQ ID NO: 163). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0937 gene using the probe oligonucleotide and one of the PCR primers. RNA 
^ for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIb227), 

30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR0937 [herein designated as DNA56436-1448] (SEQ ID NO: 159) and the derived protein sequence for 
PR0937. 

The entire nucleotide sequence of DNA56436-1448 is shown in Figure 96 (SEQ ID NO: 159). It 
contains a single open reading frame having an apparent translational initiation site at nucleotide positions 499- 
35 501 and ending at the stop codon found at nucleotide positions 2167-2169 (Figure 96, SEQ ID NO: 159). The 
predicted polypeptide precursor is 556 amino acids long, has a calculated molecular weight of approximately 
— 62,412 daltons and an estimated pi of approximately 6.62. Analysis of the full-length PR0937 sequence shown 
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in Figure 97 (SEQ ID NO: 160) evidences the presence of the following features: signal peptide at about amino 
acids 1-22; ATP/GTP-binding site motif A (P-loop) at about amino acids 515-523; a potential N-glycosylation 
site at about amino acids 514-517; and sites of glypican homology at about amino acids 54-74, 106-156, 238- 
279, 309-345, 423-459, and 468-505. 

Clone DNA56436-1448 has been deposited with ATCC on May 27, 1998, and is assigned ATCC 
5 deposit no. 209902. 

Analysis of the amino acid sequence of the full-length PR0937 polypeptide suggests that it possesses 
significant sequence similarity to glypican proteins, thereby indicating that PR0937 may be a novel glypican 
protein. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0937 amino acid sequence and the following Dayhoff sequences: 
10 GPCK_MOUSE, GPC2JRAT, GPC5 HUMAN, GPC3_HUMAN, P__R30168, CEC03H12__2, GEN13820, 
HS119E231, HDACDROME, and AF017637J. 

EXAMPLE 41 : Isolation of cDNA clones Encoding Human PRQ842 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyte EST cluster sequence designated herein as Incyte EST cluster sequence no. 69572. This EST cluster 
sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 
databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, 
CA) to identify existing homologies. The homology search was performed using the computer program BLAST 
or BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 

20 BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA54230. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Merck EST clone no. AA477092, the Merck EST clone AA477092 was purchased and 

25 the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 98 and is herein designated as DNA56855-1447. 

The full length clone shown in Figure 98 contained a single open reading name with an apparent 
translational initiation site at nucleotide positions 153-155 and ending at the stop codon found at nucleotide 
positions 510-512 v Figure 98; SEQ ID NO: 164). The predicted polypeptide precursor (Figure 99, SEQ ID 

30 NO: 165) is 119 amino acids long. PR0842 has a calculated molecular weight of approximately 13,819 Daltons 
and an estimated pi of approximately 11.16. Other features of PR0842 include a signal peptide at about amino 
acids 1-22, a potential protein kinase C phosphorylation site at about amino acids 39-41 and two potential N- 
myristoy lation sites at about amino acids 27-32 and about amino acids 46-5 1 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 98 (SEQ ID NO: 164), evidenced some homology 
between the PR0842 amino acid sequence and the following Dayhoff sequences: CEZK13111, PR80843, 
RAT5HT2X_1, S81882J, A60912, MCU60315 J37MC137L, U93422J, p_P91996, U93462J, and 
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ZN18HUMAN. 

Clone DNA56855-1447 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no. 203004. 

EXAMPLE 42: Isolation of cDNA clones Encoding Human PRQ839 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte LIFESEQ® database, designated Incyte EST Cluster No. 24479. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 
public EST databases (e,g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
10 the computer program BLAST or BLAST2 ( Altshul et al . , Methods in Enzymology 266:460-480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55709, 

15 In light of an observed sequence homology between the DNA55709 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. 754525, the Merck EST clone 754525 was purchased 
and the cDNA insert was obtained and sequenced- It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 100 and is herein designated as DNA56859-1445. 

The full length clone shown in Figure 100 contained a single open reading frame with an apparent 

20 translational initiation site at nucleotide positions 2-4 and ending at the stop codon found at nucleotide positions 
263-265 (Figure 100; SEQ ID NO: 166). The predicted polypeptide precursor (Figure 101, SEQ ID NO: 167) 
is 87 amino acids long. PR0839 has a calculated molecular weight of approximately 9,719 Daltons and an 
estimated pi of approximately 4.67. Other features of PR0839 include a signal peptide at about amino acids 1- 
23, potential protein kinase C phosphorylation sites at about amino acids 37-39 and about amino acids 85-87, 

25 a potential casein kinase II phosphorylation site at about amino acids 37-40, sequence identity with ribonucleotide 
reductase large subunit protein at about amino acids 50-60, and sequence identity with eukaryotic RNA-binding 
region RNP-l proteins at about amino acids 70-79. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 101 (SEQ ID NO: 167), evidenced some 

30 homology between the PR0839 amino acid sequence and the following Dayhoff sequences: CD14MOUSE, 
XPR6_YARLI, HS714385J, S49783, BB19 RABIT, GVPH-HALME, AB003135J, P R85453, 
LUU27081_2, and TP2B_MOUSE. 

Clone DNA56859-1445 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no.209019. 

35 
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EXAMPLE 43 : Isolation of cDNA Clones Encoding Human PRO 11 80 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte EST cluster sequence no. 14732). The Incyte EST cluster sequence no. 
14732 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a propn<n^ EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo 
5 Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et aL, Methods in Enzvmologv 266 :460-480 (1996)). Those comparisons resulting 
in a BLAST score of 70 (or in some cases 90) or greater that did i ot encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "ph^ap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA557U. 

10 In light of an observed sequence hor logy between the DNA5571 1 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. T60981, the Merck EST clone T60981 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 102 and is herein designated DNA56860-1510. 

The full length clone shown in Figure 102 contained a single open reading frame with an apparent 

1 5 translational initiation site at nucleotide positions 78-80 and ending at the stop codon found at nucleotide positions 
909-91 1 (Figure 102; SEQ ID NO; 168). The predicted polypeptide precursor is 277 amino acids long, has a 
calculated molecular weight of approximately 31,416 daltons and an estimated pi of approximately 8.88. 
Analysis of the full-length PROl 180 sequence shown in Figure 103 (SEQ ID NO: 169) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a leucine zipper pattern 

20 sequence from about amino acid 10 to about amino acid 3 1 , and potential N-myristolation sited from about amino 
acid 64 to about amino acid 69, from about amino acid 78 to about amino acid 83, from about amino acid 80 
to about amino acid 85, from about amino acid 91 to about amino acid 96 and from about amino acid 201 to 
about amino acid 206. Clone DNA56860-1510 has been deposited with the ATCC on June 9, 1998 and is 
assigned ATCC deposit no. 209952. 

25 Analysis of the amino acid sequence of the full-length PROl 180 polypeptide suggests that it possesses 

sequence similarity to the methyitransferase family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO 11 80 amino acid 
sequence and the following Dayhoff sequences, MTCI65J4, D69267, YH09 YEAST, BIOC SERMA, 
ATAC00448415T1D16.16, SHGCPIR 18, SPBC3B9_4, AB009504J4, P_W17977 and A69952. 

30 

EXAMPLE 44 : Isolation of cDNA clones Encoding Human PRO U 34 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 7511. This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
35 a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmologv 266:460-480 n 9961V Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55725. Two proprietary Genentech EST 
sequences were employed in the assembly and are shown in Figure 106 (SEQ ID NO: 172) and Figure 107 (SEQ 
ID NO: 173). 

5 In light of an observed sequence homology between the DNA55725 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. H94897, the Merck EST clone H94897 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 106 and is herein designated as DNA56865-I491. 

Clone DNA56865-1491 contains a single open reading frame with an apparent translational initiation 

10 site at nucleotide positions 153-155 and ending at ^ stop codon at nucleotide positions 1266-1268 (Figure 104). 
The predicted polypeptide precursor is 371 amino acids Jong (Figure 105). The full-length PROH34 protein 
shown in Figure 105 has an estimated molecular weight of about 4 1 ,935 daltons and a pi of about 9.58. Analysis 
of the full-length PROH34 sequence shown in Figure 105 (SEQ ID NO: 171) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 23, potential N-glycosylation sites from 

15 about amino acid 103 to about amino acid 106, from about ammo acid 249 to about amino acid 252 and from 
about amino acid 257 to about amino acid 260, and an amino acid block having homology to tyrosinase CuA- 
binding region proteins from about amino acid 280 to about amino acid 306. Clone DNA56865-1491 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203022. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 105 (SEQ ID NO: 171), evidenced significant 
homology between the PROH34 amino acid sequence and the following Dayhoff sequences: F20P518, 
AC002396J0, S47847, C64146, GSPA_BACSU, P„W10564, RFAI ECOLI, Y258_HAEIN, RFAJ_SALTY 
and P R32985. 

25 EXAMPLE 45 : isolation of cDNA clones Encoding Human PRO830 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incytedatabase, designated 20251 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary L^T DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 

30 homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et aL, Methods in Enzvmology 266:460-480 (1996)), Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55733. 

35 In light of an observed sequence homology between the DNA55733 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. H78534, the Merck EST clone H78534 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

414 



BNSDOCID: <WO .._ 



.9963088A2 J 



WO 99/63088 



PCT/US99/12252 



The sequence of this cDNA insert is shown in Figure 108 and is herein designated as DNA56866-1342. 

Clone DNA56866-1342 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 154-156 and ending at the stop codon at nucleotide positions 415-417 (Figure 108). 
The predicted polypeptide precursor is 87 amino acids long (Figure 109). The full-length PRO830 protein shown 
in Figure 109 has an estimated molecular weight of about 9 t 272 daltons and a pi of about 9. 19. Analysis of the 
5 full-length PRO830 sequence shown in Figure 109 (SEQ ID NO: 175) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 33, potential N-myristoylation sites from about 
amino acid 2 to about amino acid 7 and from about amino acid 8 to about amino acid 13 and a thioredoxin family 
of proteins homology block from about amino acid 23 to about amino acid 39. Clone UNQ470 (DNA56866- 
1342) has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203023. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 109 (SEQ ID NO: 175), evidenced significant 
homology between the PRO830 amino acid sequence and the following Dayhoff sequences: HSU88I54_1, 
HSU88153J, SAPKSGENE 1, HPU31791 5, GGCNOT2 1 , CPU91421 1, CHKESTPC091, PQ0769, 
U97553 79andB60095. 
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EXAMPLE 46 : Isolation of cDNA clones Encoding Human PROU 15 



Si Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the LIFESEQ® database, designated Incyte EST cluster sequence no. 165008. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 

20 public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul et al . , Methods in Enzymology 266:460-480 ( 1 996)) . Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that dW not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 

25 University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55726. 

In light of an observed sequence homology between the DNA55726 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R75784, the Merck EST clone R75784 was purchased 
?? and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 1 1 1 and is herein designated as DNA56868-1478. 
,v| The full length clone shown in Figure 1 10 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 189-191 and ending at the stop codon found at nucleotide 
positions 1524-1526 (Figure 110; SEQ ID NO: 176). The predicted polypeptide precursor (Figure 111, SEQ 
ID NO: 177) is 445 amino acids long. PROU15 has a calculated molecular weight of approximately 50,533 
35 Daltons and an estimated pi of approximately 8.26. Additional features include a signal peptide at about amino 
acids 1-20; potential N~glycosylation sites at about amino acids 204-207, 295-298, and 313-316; and putative 
^ transmembrane domains at about amino acids 35-54, 75-97, 126-146, 185-204, 333-350, and 353-371. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 1 1 1 (SEQ ID NO:177) T evidenced some amino 
acid sequence identity between the PROH15 amino acid sequence and the following Dayhoff sequences: 
AF053947_79, S73698, CEC47A10_4, CCOMTNDS5G J , HS4LMP2ACJ, LMP2JEBV, PA24MOUSE, 
HCU33331J7, P-W05508, and AF002273J. 
5 Clone DNA56868-1478 was deposited with the ATCC on June 23, 1998 and is assigned ATCC deposit 

no. 203024.. 

EXAMPLE 47 : Isolation of cDNA clones Encoding Human PRO 1277 

A consensus DNA sequence was assembled relative to other ESTs using repeated cycles of BLAST and 
10 the program "phrap" as described in Example 1 above. One or more of the ESTs from the assembly was 
derived from diseased coronary artery tissue. The consensus sequence obtained is designated herein as 
"DNA49434". 

In light of an observed sequence homology between the DNA49434 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3042605, the lncyte EST clone 3042605 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 112 (SEQ ID NO: 178). 

Clone DNA56869-1545 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 188-190, and an apparent stop codon at nucleotide positions 2222-2224 (Figure 1 12). 
The predicted polypeptide precursor is 678 amino acids long (Figure 113). The full-length PRO 1277 protein 
20 shown in Figure 113 has an estimated molecular weight of about 73,930 daltons and a pi of about 9.48. 
Additional features include a signal peptide at about amino acids 1-26; a transmembrane domain at about amino 
acids 181-200, and potential N-glycosylation sites at about amino acids 390-393 and 520-523. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 113 (SEQ ID NO: 179), revealed significant 
25 homology between the PRO 1277 amino acid sequence and Dayhoff sequence no AF012252_1 . Homology was 
also found between the PR01277 amino acid sequence and the following Dayhoff sequences: AF006740_1, 
CA36_HUMAN, HSU1 1, HUMCOL7A1X 1, CA17 HUMAN, MMZ78163 J, CAMAJTHICK, 
HSU69263J, YNX3 CAEEL, and MMRNAM31. 

Clone DNA56869-1545 has been deposited with ATCC and is assigned ATCC deposit no. 203161. 

30 

EXAMPLE 48 : Isolation of cDNA Clones Encoding Human PRQ1135 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52767. Based on the DNA52767 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR acDNA library that contained 
35 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PROH35. 
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In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with PCR primer pairs prepared based upon the DNA52767 sequence. A positive 
library was then used to isolate clones encoding the PRO 1 135 gene using the probe oligonucleotide and one of 
the PCR primers. RNA for construction of the cDNA libraries was isolated from human coronary artery smooth 
muscle tissue (LIB309). The cDNA libraries used to isolate the cDNA clones were constructed by standard 
5 methods using commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was 
primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, 
sized appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such 
as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., 
Science , 253:1278-1280 (1991)) in the unique Xhol and Not! sites. 
10 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01135 [herein designated as DNA56870-1492] (SEQ ID NO: 180) and the derived protein sequence for 
PROH35. 

The entire nucleotide sequence of DNA56870-1492 is shown in Figure 1 14 (SEQ ID NO: 180). Clone 
DNA56870-1492 contains a single open reading frame with an apparent translational initiation site at nucleotide 

15 positions 62-64 and ending at the stop codon at nucleotide positions 1685-1687 (Figure 114). The predicted 
polypeptide precursor is 541 amino acids long (Figure 1 15). The full-length PRO 1 135 protein shown in Figure 
115 has an estimated molecular weight of about 60,335 daltons and a pi of about 5.26. Analysis of the full- 
length PRO 1135 sequence shown in Figure 1 15 (SEQ ID NO: 181) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about aino acid 21, potential N-glycosylation sited from about amino 

20 acid 53 to about amino acid 56, from about amino acid 75 to about amino acid 78, from about amino acid 252 
to about amino acid 255 and from about amino acid 413 to about amino acid 4 16 and an amino acid block having 
homology to glycosyl hydrolase family 35 proteins from about amino acid 399 to about amino acid 414. Clone 
DNA56870-1492 has been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 209925. 
Analysis of the amino acid sequence of the full-length PROl 135 polypeptide suggests that it possesses 

25 significant sequence similarity to the alpha 1 ,2-mannosidase protein, thereby indicating that PROl 135 may be 
a novel mannosidase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PROl 135 amino acid sequence and the following Dayhoff 
sequences, DMC86E4_5, D86967_l, SPAC23 Al_4, YH04YEAST, B54408, SSM AN9MAN_1 , CEZC410_4, 
S61631 and MSU14190JL 

30 

EXAMPLE 49 : Isolation of cDNA Clones Encoding Human PROl 1 14 

A cDNA sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 
BLAST-2 sequence alignment computer program, to have certain sequence identity to other known interferon 
receptors. This cDNA sequence is herein designated DNA48466 and is shown in Figure 1 1 8 (SEQ ID NO: 184). 
35 Based on the sequence identity, probes were generated from the sequence of the DNA48466 molecule and used 
to screen a human breast carconoma library (LIB 135) prepared as described in paragraph 1 of Example 2 above. 
The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes 

417 

BNSDOCID: <WO 9963G88A2J_> 



WO 99/63088 



PCT/US99/12252 



et ah, Science , 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PGR primer 5'-AGGCTTCGCTGCGACTAGACCTC-3 ' (SEQ ID NO: 185) 
reverse PCR primer 5 ' -CC AGGTCGGGT A AGG ATGGTTG AG -3 ' (SEQ ID NO: 186) 
h ybridization probe 

5 5'-TTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC-3' (SEQ ID NO: 187) 
A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 250-252, and a stop signal at nucleotide positions 1183-1185 
(Figure 116, SEQ ID NO: 182). The predicted polypeptide precursor is 31 1 amino acids long, has a calculated 
molecular weight of approximately 35,076 daltons and an estimated pi of approximately 5.04. Analysis of the 

10 full-length PROl 1 14 interferon receptor sequence shown in Figure 1 1 7 (SEQ ID NO: 183) evidences die presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 29, a transmembrane domain 
from about amino acid 230 to about amino acid 255, potential N-giycosylation sites from about amino acid 40 
to about amino acid 43 and from about amino acid 134 to about amino acid 137, an amino acid sequence block 
having homology to tissue factor proteins from about amino acid 92 to about amino acid 1 19 and an amino acid 

1 5 sequence block having homology to integrin alpha chain proteins from about amino acid 232 to about amino acid 
262. Clone DNA57033-1403 has been deposited with ATCC on May 27, 1998 and is assigned ATCC deposit 
no. 209905. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 1 17 (SEQ ID NO: 183), evidenced significant 
20 homology between the PRO 1 1 14 interferon receptor amino acid sequence and the following Dayhoff sequences: 
G01418, INR1MOUSE, PR71035, INGS HUMAN, A26595 1, A26593 1, 156215 and TF HUMAN. 

EXAMPLE 50 : Isolation of cDNA Clones Encoding Human PRQ828 

A consensus DNA sequence was identified using the method described in Example 1 above. This 
25 consensus sequence is herein designated DNA35717. Based on the DNA35717 consensus sequence, 

oligonucleotides were synthesized: 1) to identify by PCR acDNA library that contained the sequence of interest, 

and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0828. 
PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5*-GC AGGACTTCTACGACTTCAAGGC-3' (SEQ ID NO: 190); and 
30 reverse PCR primer 5 ' - AGTCTGGGCC AGGT ACTTG A AGGC-3 ' (SEQ ID NO: 191). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35717 

sequence which had the following nucleotide sequence: 

hybridization probe 

5 ' -CAACATCCGGGGC AA ACTGGTGTCGCTGGAGA AGTACCGCGG ATCGGTGT-3 ' (SEQ ID NO: 192) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0828 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0828 [herein designated as DNA57037-1444] (SEQ ID NO: 188) and the derived protein sequence for 
PR0828. 

The entire nucleotide sequence of DNA57037-1444 is shown in Figure 1 19 (SEQ ID NO: 188). Clone 
^ 5 DNA57037-1444 contains a single open reading frame with an apparent translational initiation site at nucleotide 

positions 34-36 and ending at the stop codon at nucleotide positions 595-597 (Figure 119). The predicted 
fl polypeptide precursor is 187 amino acids long (Figure 120). The full-length PR0828 protein shown in Figure 

^ 120 has an estimated molecular weight of about 20,996 daltons and a pi of about 8.62. Analysis of the full- 

length PR0828 sequence shown in Figure 120 (SEQ ID NO: 189) evidences the presence of the following: a 
v, 10 signal peptide at about amino acids 1-21; sequent wo identity to glutathione peroxidases signature 2 at about 

amino acids 82-89; sequence identity to glutathione peroxidases selenocysteine proteins at about amino acids 35- 
60, 63-100, 107-134, and 138-159. Clone DNA57037-1444 has been deposited with ATCC on May 27, 1998, 
and is assigned ATCC deposit no. 209903. 

Analysis of the amino acid sequence of the full-length PRO 828 polypeptide suggests that it possesses 
15 significant sequence similarity to glutathione peroxidases, thereby indicating that PR0828 may be a novel 
peroxidase enzyme. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
% evidenced sequence identity between the PR0828 amino acid sequence and the following Dayhoff sequences: 

^ AF053311J, CELT09A12_2, AC004151J3, BTUEJSCOLI, CER05H10J3, PJ>80918, PWU88907_1, and 

P_W22308. 

20 

EXAMPLE 51 : Isolation of cDNA clones Encoding Human PRO 1009 

A cDNA clone (DNA57129-1413) encoding a native human PRO1009 polypeptide was identified by 
the use of a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that preferentially 
represents the 5' ends of the primary cDNA clones. First SEQ ID NO: 195 (Figure 123) was identified, which 
25 was extended by alignments to other EST sequences to form a consensus sequence. Oligonucleotide probes 
based upon the consensus sequence were synthesized and used to screen the cDNA library which gave rise to 
the full-length DNA57129-1413 clone. 

The full length DNA57129-1413 clone shown in Figure 121 contained a single open reading frame with 
^ an apparent translational initiation site at nucleotide positions 41-43 and ending at the stop codon found at 

30 nucleotide positions 1886-1888 (Figure 121; SEQ ID NO:193). The predicted polypeptide precursor (Figure 
>| 122, SEQ ID NO: 194) is 615 amino acids long. Figure 122 also shows the approximate locations of the signal 

sequence, transmembrane domains, myristoylation sites, a glycosyiation site and an AMP-binding domain. 
PRO1009 has a calculated molecular weight of approximately 68,125 daltons and an estimated pi of 
approximately 7.82. Clone DNA57129-1413 has been deposited with ATCC and is assigned ATCC deposit no. 
35 209977. It is understood that the deposited clone has the actual and correct sequence and that the representations 
herein may have minor, normal sequencing errors. 
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Based on a WU-BLAST-2 sequence alignment analysis (using the ALIGN computer program) of the 
full-length sequence, PRO 1009 shows amino acid sequence identity to at least the following proteins which were 
designated in a Dayhoff database as follows: F69893, CEF28F8 2, BSY13917J7, BSY13917J7, D69187, 
D69649, XCRPFBJ, E64928, YDID ECOLI, BNACSF8J and RPU75363J2, 

5 EXAMPLE 52 : Isolation of cDNA Clones Encoding Human PRO 1007 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated as DNA40671. 

In light of an observed sequence homology between the DNA40671 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T70513, the Merck EST clone T70513 was purchased 
10 and „*j cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 124. 

The entire nucleotide sequence of DNA57690-1374 is shown in Figure 124 (SEQ ID NO: 196). Clone 
DNA57690-1374 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 16-18 and ending at the stop codon at nucleotide positions 1054-1056 (Figure 124). The predicted 
15 polypeptide precursor is 346 amino acids long (Figure 125). The full-length PRO 1007 protein shown in Figure 
125 has an estimated molecular weight of about 35,971 daltons and a pi of about 8.17. Clone DNA57690-1374 
has been deposited with the ATCC on June 9, 1998. Regarding the sequence , it is understood that the deposited 
clone contains the actual sequence, and the sequences provided herein are based on known sequencing 
techniques. The representative figures herein show the representative numbering. 
20 Analysis of the amino acid sequence of the full-length PRO 1007 polypeptide suggests that portions of 

it possess sequence identity to MAGPIAP, thereby indicating that PRO 1007 may be a novel member of the 
family to which MAGPIAP belongs. . 

Still analyzing the amino acid sequence of SEQ ID NO: 197, the putative signal peptide is at about amino 
acids 1-30 of SEQ ID NO: 197. The transmembrane domain is at amino acids 325-346 of SEQ ID NO: 197. N- 
25 glycosylation sites are at about amino acids 118-121, 129-132, 163-166, 176-179, 183-186 and 227-130 of SEQ 
ID NO: 197. Ly-6/u-Par domain protein homology is at about amino acids 17-36 and 209-222 of SEQ ID 
NO: 197. The corresponding nucleotides of the amino acids presented herein can be routinely determined given 
the sequences provided herein. 

30 EXAMPLE 53 : Isolation of cDNA clones Encoding Human PRO 1056 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the lncyte database, designated herein as 6425. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (Lifeseq®, lncyte Pharmaceuticals, Palo Alto, CA) to identify 

35 existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et aL, Methods in Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
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consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA55736. 

In light of an observed sequence homology between the DNA55736 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R88049, the Merck EST clone R88049 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 126 and is herein designated as DNA57693-1424. 

Clone DNA57693-1424 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 56-58 and ending at the stop codon at nucleotide positions 416-418 (Figure 126), The 
predicted polypeptide precursor is 120 amino acids long (Figure 127). The full-length PRO1056 protein shown 
in Figure 127 has an estimated molecular weight of about 13,345 daltons and a pi of about 5.18. Analysis of 

10 the full-length PRO1056 sequence shown in Figure 127 (SEQ ID NO: 199) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 1 8, a transmembrane domain from about 
amino acid 39 to about amino acid 58, a potential N-glycosylation site from about amino acid 86 to about amino 
acid 89, protein kinase C phosphorylation sites from about amino acid 36 to about amino acid 38 and from about 
amino acid 58 to about amino acid 60, a tyrosine kinase phosphorylation site from about amino acid 25 to about 

15 amino acid 32 and an amino acid sequence block having homology to channel forming colic in proteins from 
about amino acid 24 to about amino acid 56. Clone DNA57693- 1424 has been deposited with ATCC on June 
23, 1998 and is assigned ATCC deposit no. 203008. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 127 (SEQ ID NO: 199), evidenced significant 

20 homology between the PRO1056 amino acid sequence and the following Dayhoff sequences: PLMHUMAN, 
A40533, ATNG HUMAN, A55571, ATNG SHEEP, S31524, GEN13025, RIC MOUSE, A48678 and 
A10871J. 

EXAMPLE 54 : Isolation of cDNA clones Encoding Human PRQ826 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 47283, This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

30 et al . , Methods in Enzvmolo gv 266:460-480 ( 1 996)) . Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56O00. 

In light of an observed sequence homology between the DNA56000 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. W69233, the Merck EST clone W69233 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 128 and is herein designated as DNA57694-1341. 
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Clone DNA57694-1341 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 13-15 and ending at the stop codon at nucleotide positions 310-3 12 (Figure 128). The 
predicted polypeptide precursor is 99 amino acids long (Figure 129). The full-length PR0826 protein shown 
in Figure 129 has an estimated molecular weight of about 1 1,050 daltons and a pi of about 7.47. Analysis of 
the full-length PR0826 sequence shown in Figure 129 (SEQ ID NO: 201) evidences the presence of the 
5 following: a signal peptide from about amino acid I to about amino acid 22, potential N-myristoylation sites from 
about amino acid 22 to about amino acid 27 and from about amino acid 90 to about amino acid 95 and an amino 
acid sequence block having homology to peroxidase from about amino acid 16 to about amino acid 48. Clone 
DNA57694-1341 has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no, 203017. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 129 (SEQ ID NO: 201), evidenced significant 
homology between the PR0826 amino acid sequence and the fci -owing Dayhoff sequences: CCU12315_1, 
SCU96108_6, CELF39F104 and HELTHELHO. 

EXAMPLE 55 : Isolation of cDNA clones Encoding Human PRQ819 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 49605. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56015. 

In light of an observed sequence homology between the DNA56015 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. H65785, the Merck EST clone H65785 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 130 and is herein designated as DNA57695-1340. 

Clone DNA57695-1340 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 202-2C * (Figure 130). The 

30 predicted polypeptide precursor is 52 amino acids long (Figure 131). The full-length PR0819 protein shown 
in Figure 131 has an estimated molecular weight of about 5,216 daltons and a pi of about 4.67. Analysis of the 
full-length PR0819 sequence shown in Figure 131 (SEQ ID NO:203) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a potential N-myristoylation site from about 
amino acid 2 to about amino acid 7 and a region having homology to immunoglobulin light chain from about 

35 amino acid 5 to about amino acid 33. Clone DNA57695-1340 has been deposited with ATCC on June 23, 1998 
and is assigned ATCC deposit no. 203006. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 131 (SEQ ID NO:203), evidenced significant 
homology between the PR0819 amino acid sequence and the following Dayhoff sequences: HSU03899_1, 
HUMIGLITEB l , VG28_HSVSA, AF031522_1, PAD 1_ YEAST and AF045484J. 

5 EXAMPLE 56 : Isolation of cDNA Clones Encoding Human PRO 1006 

An initial candidate sequence from Incyte cluster sequence no. 45748 was identified using the signal 
algorithm process described in Example 3 above. This sequence was then aligned with a variety of public and 
Incyte EST sequences and a consensus sequence designated herein as DNA56036 was derived therefrom. 

In light of an observed sequence homology between the DNA56036 consensus sequence and an EST 
10 sequence encompassed within the Merck EST clone no. 489737, the Merck EST clone 489737 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 132. 

The entire nucleotide sequence of DNA57699-1412 is shown in Figure 132 (SEQ ID NO:204). Clone 
DNA57699-1412 contains a single open reading frame with an apparent translational initiation site at nucleotide 
15 positions 28-30 and ending at the stop codon at nucleotide positions 1204-1206 (Figure 132). The predicted 
polypeptide precursor is 392 amino acids long (Figure 133). The full-length PRO1006 protein shown in Figure 
133 has an estimated molecular weight of about 46, 189 daltons and a pi of about 9.04. Clone DNA57699-14 12 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques, 
20 Analyzing the amino acid sequence of SEQ ID NO:205, the putative signal peptide is at about amino 

acids 1-23 of SEQ ID NO:205. The N-glycosylation sites are at about amino acids 40-43, 53-56, 204-207 and 
373-376 of SEQ ID NO:205. An N-myristoylation site is at about amino acids 273-278 of SEQ ID NO:205. 

The corresponding nucleotides of these amino acid regions and others can be routinely determined given the 
sequences provided herein. 

25 

EXAMPLE 57 : Isolation of cDNA Clones Encoding Human PRO 11 12 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a specific 
EST cluster sequence. This EST cluster sequence was then compared to a variety of expressed sequence tag 
(EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 

30 (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The homology search was 
performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in Enzvmology 266:460- 
480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" 
(Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is 

35 herein designated DNA56018. 

In light of an observed sequence homology between the DNA56018 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA223546, the Merck EST clone AA223546 was 
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purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 134 and is herein designated as DNA57702-1476. 

The entire nucleotide sequence of DNA57702-1476 is shown in Figure 134 (SEQ ID NO:206). Clone 
DNA57702-1476 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 20-22 and ending at the stop codon nucleotide positions 806-808 of SEQ ID NO:206 (Figure 134). 
5 The predicted polypeptide precursor is 262 amino acids long (Figure 135). The full-length PRO 1 112 protein 
shown in Figure 135 has an estimated molecular weight of about 29,379 daltons and a pi of about 8.93. Figure 
135 also shows the approximate locations of the signal peptide and ransmernbrane domains. Clone DNA57702- 
1476 has been deposited with the ATCC on June 9, 1998. It is unc^rstood that the deposited clone has the actual 
nucleic acid sequence and that the sequences provided herein are based on known sequencing techniques. 

10 Analysis of the amino acid sequence c*" he full-length PROl 1 12 polypeptide suggests that it possesses 

some sequence similarity to other proteins. More specifically, an analysis of the Dayhoff database (version 
35.45 SwissProt 35) evidenced some sequence identity between the PROl 1 12 amino acid sequence and at least 
the following Dayhoff sequences, MTY20B1113 (a mycobacterium tuberculosis peptide), F64471, 
AE000690 6, XLU16364J, E43259 (H+ -transporting ATP synthase) and PIGSLADRXEJ (MHC class II 

15 histocompatibility antigen). 

EXAMPLE 58 : Isolation of cDNA clones Encoding Human PRO 1074 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte cluster sequence No. 42586). This cluster sequence was then compared to 

20 a variety of expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and 
a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

25 DNA sequence with the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DN A5625 1 . 

In light of an observed sequence homology between the DNA56251 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA081912, the Merck EST clone AA081912 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 

30 protein. The sequence of this cDNA insert is shown in Figure 136 and is the full-length DNA sequence for 
PRO1074. Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC 
deposit no. 209953. 

The entire nucleotide sequence of DNA57704-1452 is shown in Figure 136 (SEQ ID NO:208). Clone 
DN A57704- 1452 contains a single open reading frame with an apparent translational initiation site at nucleotide 
35 positions 322-324 and ending at the stop codon at nucleotide positions 13 15-1317 (Figure 136). The predicted 
polypeptide precursor is 331 amino acids long (Figure 137). The full-length PRO 1074 protein shown in Figure 
137 has an estimated molecular weight of about 39,512 Daltons and a pi of about 8.03. Analysis of the full- 
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length PRO 1074 sequence shown in Figure 137 (SEQ ID NO; 209) evidences the presence of the following 
features: a transmembrane domain at about amino acids 20 to 39; potential N-glycosylation sites at about amino 
acids 72 to 75, 154 to 157, 198 to 201 , 212 to 215, and 326 to 329; a glycosaminoglycan attachment site at about 
amino acids 239 to 242, and a Ly-6/u-PAR domain at about amino acids 23 to 36. 

Analysis of the amino acid sequence of the full-length PRO 1074 polypeptide suggests that it possesses 
5 significant sequence similarity to beta 1 ,3-galactosyltransf erase , thereby indicating that PRO 1 074 may be a novel 
member of the galactosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length 
PRO 1074 polypeptide using the Dayhoff database (version 35,45 SwissProt 35) evidenced homology between 
the PRO 1074 amino acid sequence and the following Dayhoff sequences: AF029792 1, PR57433, 
DMU41449J, AC000348J4, P R47479, CET09F5_2, CEF14B6_4, CET15D6_5, CEC54C8_4, and 
10 CEE0jri4J0. 

Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209953. 

EXAMPLE 59 : Isolation of cDNA clones Encoding Human PRO 1005 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the LIFESEQ® database, Incyte cluster sequence no. 49243. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g,, GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 

20 BLAST2 (Altshul et aL, Methods in Enzymology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56380. 

In light of an observed sequence homology between the DNA56380 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. AA256657, the Merck EST clone AA256657 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 1 38 and is herein designated as DNA57708- 141 1 . 

The full length clone shown in Figure 138 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 30-32 and ending at the stop codon found at nucleotide positions 

30 585-587 (Figure 138; SEQ ID NO:210). The predicted polypeptide precursor (Figure 139, SEQ ID NO:211) 
is 185 amino acids long. PRO 1005 has a calculated molecular weight of approximately 20,331 daltons and an 
estimated pi of approximately 5.85. Clone DNA57708-141 1 was deposited with the ATCC June 23, 1998, and 
is assigned ATCC deposit no. 203021. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 139 (SEQ ID NO:211), evidenced some 
homology between the PRO1005 amino acid sequence and the following Dayhoff sequences: DDU07187 1, 
DDU87912J, CELD1007J4, A42239, DDU42597J, CYAG DICDI, S50452, MRKCKLEPN, P-R41998, 
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and XYNARUMFL. 

EXAMPLE 60 : Isolation of cDNA clones Encoding Human PRO 1073 

An initial DNA sequence referred to herein as DNA55938 and shown in Figure 142 (SEQ ID NO:214) 
was identified using a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that 
5 preferentially represents the 5* ends of the primary cDNA clones. DNA55938 was then compared to ESTs from 
public databases (e.g., GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo 
Alto, CA), using the computer program BLAST or BLAST2 [ Altschul et al. , Methods in Enzvmology. 266:460- 
480 (1996)]. The ESTs were clustered and assembled into a consensus DNA sequence using the computer 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
10 obtained is designated herein as DNA56411. 

In light of an observed sequence homology between the DNA5641 1 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H86027, the Merck EST clone H 86027 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 140. 
15 The full length DNA577 10- 145 1 clone shown in Figure 1 40 contained a single open reading frame with 

an apparent translational initiation site at nucleotide positions 345-347 and ending at the stop codon found at 
nucleotide positions 1242-1244 (Figure 140; SEQ ID NO:212). The predicted polypeptide precursor (Figure 
141, SEQ ID NO:213) is 299 amino acids long. PRO1073 has a calculated molecular weight of approximately 
34,689 daltons and an estimated pi of approximately 11.49. The PRO 1073 polypeptide has the following 
20 additional features: a signal peptide at about amino acids 1-31 , sequence identity to bZIP transcription factor 
basic domain signature at about amino acids, a potential N-gly cosy lat ion site at about amino acids 2-5, and 
sequence identity with protamine PI proteins at about amino acids 158-183. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 141 (SEQ ID NO:213), revealed some sequence 
25 identity between the PRO1073 amino acid sequence and the following Dayhoff sequences: MMU37351_1, 
ATAC00250510T9J22.10, S59043, ENXNUPRJ, B47328, SR55JDROME, S26650, SON HUMAN, 
VIT2_CHICK, and XLC4SRPRT 1. 

Clone DNA57710-1451 was deposited with the ATCC on July 1 , 1998 and is assigned ATCC deposit 
no. 203048. 

30 

EXAMPLE 61 isolation of cDNA clones Encoding Human PROl 152 

A cDNA clone (DNA5771 1-1501) encoding a native human PROl 152 polypeptide was identified by 
employing a yeast screen, in a human infant brain cDNA library that preferentially represents the 5' ends of the 
primary cDNA clones. Specifically, a yeast screen was employed to identify a cDNA designated herein as 
35 DNA55807 (SEQ ID NO:217; see Figure 145). 

In light of an observed sequence homology between the DNA55807 sequence and an EST sequence 
encompassed within the Merck EST clone no. R56756, the Merck EST clone R56756 was purchased and the 
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cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 143. 

The full-length DNA5771 1-1501 clone shown in Figure 143 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 58-60 and ending at the stop codon at nucleotide 
positions 1495-1497 (Figure 143). The predicted polypeptide precursor is 479 amino acids long (Figure 144). 
5 The full-length PRO 1 152 protein shown in Figure 144 has an estimated molecular weight of about 53,602 daltons 
and a pi of about 8.82. Analysis of the full-length PROl 152 sequence shown in Figure 144 (SEQ ID NO:216) 
evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 28, 
transmembrane domains from about amino acid 133 to about amino acid 155, from about amino acid 168 to 
about amino acid 187, from about amino acid 229 to about amino acid 247, from about amino acid 264 to about 
10 amino acid 285, from about amino acid 309 to about amino acid 330, from about amino acid 371 to about amino 
acid 390 and from about amino acid 441 to about amino acid 464, potential N-glycosylation sites from about 
amino acid 34 to about amino acid 37 and from about amino acid 387 to about amino acid 390 and an amino acid 
sequence block having homology to a respiratory-chain NADH dehydrogenase subunit from about amino acid 
243 to about amino acid 287. Clone DNA5771 1-1501 has been deposited with ATCC on July 1, 1998 and is 
15 assigned ATCC deposit no. 203047. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
H| alignment analysis of the full-length sequence shown in Figure 144 (SEQ ID NO:216), evidenced significant 

homology between the PROl 152 amino acid sequence and the following Dayhoff sequences: AF052239_1, 
SYNN9CGA_1, SFCYTB2J, GEN12507, PJU1769, MTV025J09, C61168, S43171, P__P61689 and 
20 P P61696. 

EXAMPLE 62 : isolation of cDNA clones Encoding Human PROl 136 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 109142. This EST cluster sequence was then compared 
25 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
30 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56039. 

In light of an observed sequence homology between the DNA56039 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. HSC1NF011, the Merck EST clone HSC1NF01 1 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
35 protein. The sequence of this cDNA insert is shown in Figure 146 and is herein designated as DNA57827-1493. 

Clone DNA57827-1493) contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 216-218 and ending at the stop codon at nucleotide positions 21 12-2 i 14 (Figure 146). 
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The predicted polypeptide precursor is 632 amino acids long (Figure 147). The full-length PROl 136 protein 
shown in Figure 147 has an estimated molecular weight of about 69,643 daltons and a pi of about 8.5. Analysis 
of the full-length PROl 136 sequence shown in Figure 147 (SEQ ID NO:219) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15 and potential N-glycosylation sites 
from about amino acid 108 to about amino acid 11, from about amino acid 157 to about amino acid 160, from 
5 about amino acid 289 to about amino acid 292 and from about amino acid 384 to about amino acid 387. Clone 
DNA57827-1493 has been deposited with ATCC on July 1, 1998 and is assigned ATCC deposit no. 203045. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 147 (SEQ ID NO:2I9), evidenced significant 
homology between the PROl 136 amino acid sequence and the following Dayhoff sequences: AF034746 1, 
10 AF034745_1, MMAF000168J9, HSMUPP1J; AF060539 J , SP97 RAT, 138757, MMU93309_1, 
CEK01A6_4 and HSA224747J. 

EXAMPLE 63 : Isolation of cDNA clones Encoding Human PRQ8I3 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyte EST cluster sequence (Incyte EST cluster sequence no. 45501 . The Incyte EST cluster sequence no. 
45501 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ ™, Incyte Pharmaceuticals, Palo 
Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et aL, Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting 

20 in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56400. 

In light of an observed sequence homology between the DNA56400 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T90592, the Merck EST clone T90592 was purchased 

25 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 148 and is herein designated DNA57834-1339. 

The full length clone shown in Figure 148 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 109-111 and ending at the stop codon found at nucleotide 
positions 637-63? (Figure 149; SEQ ID NO:221). The predicted polypeptide precursor is 176 amino acids long, 

30 has a calculated molecular weight of approximately 19,616 daltons and an estimated pi of approximately 7.11. 
Analysis of the full-length PR0813 sequence shown in Figure 149 (SEQ ID NO:221) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 26 and potential N-myristoylation 
sites from about amino acid 48 to about amino acid 53, from about amino acid 153 to about amino acid 158, 
from about amino acid 156 to about amino acid 161 and from about amino acid 167 to about amino acid 172. 

35 Clone DNA57834-1339 has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 
209954. 



428 



BNSDOCID: <WO 



9963088A2J.. 



WO 99/63088 



PCT/US99/12252 



Analysis of the amino acid sequence of the full-length PR0813 polypeptide suggests that it possesses 
sequence similarity to the pulmonary surfactant-associated protein C. More specifically, an analysis of the 
Dayhoff database (version 35.45 SwissProt 35) evidenced some degree of homology between the PR0813 amino 
acid sequence and the following Dayhoff sequences ,PSPC_M US VI, PP92071, G02964, PR65489, P_P82977, 
P_R84555, S55542, MUSIGHAJJ and PH11j6. 

5 

EXAMPLE 64 : Isolation of cDNA Clones Encoding Human PRQ8Q9 

Use of the signal sequence algorithm described in Exam|. le 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence ^'as then compared to a variety of expressed 
sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DNA 

10 database (LIFESEQ™, Incyte Pharmaceuticals, ilo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 ( Altshul et al. , Methods in Enzvmology 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56418. 

In light of an observed sequence homology between the DNA56418 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H74302, the Merck EST clone H74302 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 150 and is herein designated DNA57836-1338. 

20 The entire nucleotide sequence of DNA57836-I338 is shown in Figure 150 (SEQ ID NO:222). Clone 

DNA57836-1338 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 63-65 and ending at the stop codon at nucleotide positions 858-860 of SEQ ID NO: 222 (Figure 150). 
The predicted polypeptide precursor is 265 amino acids long (Figure 151). The full-length PRO809 protein 
shown in Figure 151 has an estimated molecular weight of about 29,061 daltons and a pi of about 9. 18. Figure 

25 151 further shows the approximate positions of the signal peptide and N-glysosylation sites. The corresponding 
nucleotides can be determined by referencing Figure 150. Clone DNA57836-1338 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 
the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO809 polypeptide suggests that it possesses 

30 some sequence similarity to the heparin sulfate proteoglycan and to endothelial cell adhesion molecule- 1 . More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PRO809 amino acid sequence and the following Dayhoff sequences, PGBMJMOUSE, D82082 1 
and PW14158. 

35 EXAMPLE 65 : Isolation of cDNA Clones Encoding Human PRQ791 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 

429 



<WO_ 9963088A2J.> 



WO 99/63088 



PCT/US99/12252 



I 



sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DNA 
database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in Enzvmologv 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
5 program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56429. 

In light of an observed sequence homology between the DNA56429 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 36367, the Merck EST clone 36367 was purchased and 
the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
10 sequence of this cDNA insert is shown in Figure IL~ and is herein designated DNA57838-1337. 

The entire nucleotide sequence of DNA57838-1337 is shown in Figure 152 (SEQ ID NO:224). Clone 
DNA57838-1337 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 9-11 and ending at the stop codon at nucleotide positions 747-749 of SEQ ID NO:224 (Figure 152). 
The predicted polypeptide precursor is 246 amino acids long (Figure 153). The full-length PR0791 protein 
15 shown in Figure 153 has an estimated molecular weight of about 27,368 daltons and a pi of about 7.45. Figure 
153 also shows the approximate locations of the signal peptide, the transmembrane domain, N-glycosylation 
sites and a region conserved in extracellular proteins. The corresponding nucleotides of one embodiment 
provided herein can be identified by referencing Figure 152. Clone DNA57838-1337 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 
20 the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0791 polypeptide suggests that it has sequence 
similarity with MHC-I antigens, thereby indicating that PR0791 may be related to MHC-I antigens. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some sequenc identity 
between the PR0791 amino acid sequence and the following Dayhoff sequences, AF034346_1 , MMQ1K51 and 
25 HFE HUMAN. 



EXAMPLE 66 : Isolation of cDNA clones Encoding Human PRO 1004 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence, Incyte cluster sequence No. 73681 . This EST cluster sequence was then compared 

30 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, Univ. of Washington, Seattle, 

35 Washington). The consensus sequence obtained therefrom is herein designated as DNA56516. 

In light of an observed sequence homology between the DNA56516 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H43837, the Merck EST clone H43837 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 154. 

The full length clone shown in Figure 154 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 1 19-121 and ending at the stop codon at nucleotide positions 
464-466 (Figure 154; SEQ ID NO;226). The predicted polypeptide precursor is 115 amino acids long (Figure 
5 155; SEQ ID NO:227). The full-length PRO1004 protein shown in Figure 155 has an estimated molecular 
weight of about 13,649 daltons and a pi of about 9.58. Analysis of the full-length PRO 1004 sequence shown 
in Figure 155 (SEQ ID NO: 227) evidences the presence of the following features: a signal peptide at about amino 
acids 1-24, a microbodies C -terminal targeting signal at about amino acids 1 13-1 15, a potential N-glycosylation 
site at about amino acids 71-74, and a domain having sequence identity with dihydrofolate reductase proteins at 
10 about amino acids 22-48. 

Analysis of the amino acid sequence of the full-length PRO 1004 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PRO 1004 amino acid sequence and the following 
Dayhoff sequences: CELR02D3J7, LECI_MOUSE, AF006691_3, SSZ97390J, SSZ97395 1, and 
SSZ97400J. 

15 Clone DNA57844-1410 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 

no. 203010. 

EXAMPLE 67 : Isolation of cDNA clones Encoding Human PROl 1 1 1 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) 

20 was searched and an EST was identified which had homology to insulin- like growth factor binding protein. 

RN A for construction of cDNA libraries was isolated from human fetal brain. The cDNA libraries used 
to isolate the cDN A clones encoding human PRO 1111 were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was printed with oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 

25 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et aL, Science , 
253:1278-1280 (1991)) in the unique Xhol and NotI. 

The human fetal brain cDNA libraries (prepared as described above), were screened by hybridization 
with a synthetic oligonucleotide probe based upon the Incyte EST sequence described above: 

30 5 1 -CC ACC ACCTGGAGGTCCTGC AGTTGGGC AGG A ACTCC ATCCGGC AG ATTG-3 ' (SEQ ID NO:251). 

An identified cDNA clone was sequenced in entirety. The entire nucleotide sequence of PROl 1 1 1 is 
shown in Figure 156 (SEQ ID NO:228). Clone DNA58721-1475 contains a single open reading frame with an 
apparent translational initiation site at nucleotide positions 57-59 and a stop codon at nucleotide positions 2016- 
2018 (Figure 156; SEQ ID NO:228). The predicted polypeptide precursor is 653 amino acids long (Figure 157). 

35 The transmembrane domains are at positions 21-40 (type II) and 528-548. Clone DNA58721-1475 has been 
deposited with ATCC and is assigned ATCC deposit no. 2031 10. The full-length PROl 1 1 1 protein shown in 
Figure 157 has an estimated molecular weight of about 72,717 daltons and a pi of about 6.99. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 157 (SEQ ID NO:229), revealed some sequence 
identity between the PROl 1 1 1 amino acid sequence and the following Dayhoff sequences: A58532, D86983 1 , 
RNPLGPVJ, PGS2 HUMAN, AF038127J, ALS_MOUSE, GPV HUMAN, PGS2_BOVIN, ALS_PAPPA 
and 147020- 

5 

EXAMPLE 68 : Isolation of cDNA clones Encoding Human PRO 1344 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA33790. Based on the DNA33790 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PC R a cDNA library that contained 
10 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1344. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' - AGGTTCGTG ATGGAGAC A ACCGCG-3 ' (SEQ ID NO:232) 
reverse PCR primer 5 ' -TGTC A AGG ACGC ACTGCCGTC ATG -3 ' (SEQ ID NO:233) 
15 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA33790 
sequence which had the following nucleotide sequence 
hybridization probe 

S'-TGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCC^' (SEQ ID NO:234) 
In order to screen several libraries for a source of a full- length clone, DNA from the libraries was 

20 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1344 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01344 (designated herein as DNA58723-1588 [Figure 158, SEQ ID NO:230]); and the derived protein 

25 sequence for PRO 1 344 . 

The entire nucleotide sequence of DNA58723-1588 is shown in Figure 158 (SEQ ID NO:230). Clone 
DNA58723-1588 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 26-28 and ending at the stop codon at nucleotide positions 2186-2188 (Figure 158). The predicted 
polypeptide precursor is 720 amino acids long (Figure 159). The full-length PRO 1344 protein shown in Figure 

30 159 has an estimated molecular weight of about 80, 199 daltons and a pi of about 7.77. Analysis of the full- 
length PR01344 sequence shown in Figure 159 (SEQ ID NO:231) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 23, an EGF-like domain cysteine protein signature 
sequence from about amino acid 260 to about amino acid 271 , potential N-glycosylation sites from about amino 
acid 96 to about amino acid 99, from about amino acid 279 to about amino acid 282, from about amino acid 316 

35 to about amino acid 319, from about amino acid 451 to about amino acid 454 and from about amino acid 614 
to about amino acid 617, an amino acid sequence block having homology to serine proteases, trypsin family from 
about amino acid 489 to about amino acid 505 and a CUB domain protein profile sequence from about amino 
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acid 150 to about amino acid 166. Clone DNA58723-1588 has been deposited with ATCC on August 18, 1998 
and is assigned ATCC deposit no. 203133. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 159 (SEQ ID NO:231), evidenced significant 
homology between the PRO 1344 amino acid sequence and the following Dayhoff sequences: S77063_l, 
5 CRARJvlOUSE, PJR74775, PJ>90070, P_R09217, P_P70475, HSBMP16_1 and U50330 1. 

EXAMPLE 69 : Isolation of cDNA clones Encoding Human PROI109 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DN A52642. The consensus DNA sequence 

1 0 was obtained by extending using repeated cycles of BLAST and phrap a previously obtained consensus sequence 
as far as possible using the sources of EST sequences discussed above. Based on the DNA52642 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1109. 
PCR primers (forward and reverse) were synthesized: 

15 forward PCR primer 5 * -CCTTACCTCAGAGGCCAGAGC AAGC-3 ' (SEQ ID NO:237) 
reverse PCR primer 5 ' -G AGCTTC ATCCGTTCTGCGTTC ACC -3 ' (SEQ ID NO:238) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA52642 
sequence which had the following nucleotide sequence 
hybridization probe 

20 5'-CAGGAATGTAAAGCTTTACAGAGGGTCGCCATCCTCGTTCCCCACC-3' (SEQ ID NO:239) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 109 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human SK-Lu-i adenocarcinoma cell tissue (LIB247). 

25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 109 (designated herein as DNA58737-1473 [Figure 160, SEQ ID NO:235]) and the derived protein 
sequence for PROl 109. 

The entire nucleotide sequence of DNA. £r 8737-1473 is shown in Figure 160 (SEQ ID NO:235). Clone 
DNA58737-1473 contains a single open reading frame with an apparent translational initiation site at nucleotide 

30 positions 1 19-120 and ending at the stop codon at nucleotide positions 1 15 1- 1 153 (Figure 160). The predicted 
polypeptide precursor is 344 amino acids long (Figure 161). The full-length PROl 109 protein shown in Figure 
161 has an estimated molecular weight of about 40,041 daltons and a pi of about 9.34. Analysis of the full- 
length PROl 109 sequence shown in Figure 161 (SEQ ID NO:236) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 27, potential N-glycosylation sites from about amino 

35 acid 4 to about amino acid 7, from about amino acid 220 to about amino acid 223 and from about amino acid 
335 to about amino acid 338 and an amino acid sequence block having homology to xylose isomerase proteins 
from about amino acid 191 to about amino acid 201. Clone DNA58737-1473 has been deposited with ATCC 
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on August 18, 1998 and is assigned ATCC deposit no. 203136. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 161 (SEQ ID NO:236), evidenced significant 
homology between the PROl 109 amino acid sequence and the following Dayhoff sequences: HSUDPGAL 1, 
HSUDPB141, NALSJBOVIN, HSU10473J, CEW02B12 11, YNJ4CAEEL, AE000738J 1, CET24D1 1, 
5 S48121 and CEGLY9J. 

EXAMPLE 70 : Isolation of cDNA clones Encoding Human PRO 1383 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA5396L Based on the DNA53961 
10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1383. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CATTTCCTTACCCTGGACCCAGCTCC 3' (SEQ ID NO:242) 
15 reverse PCR primer 5'~GAAAGGCCCACAGCACATCTGGCAG-3' (SEQ ID NO:243) 

Additionally , a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA53961 
sequence which had the following nucleotide sequence 
hybridization probe 

5*~CCACGACCCGAGCAACTTCCTCAAGACCGACTTGTTTCTCTACAGC-3' (SEQ ID NO:244) 
20 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1383 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
25 PR01383 (designated herein as DNA58743-1609 [Figure 162, SEQ ID NO: 240]) and the derived protein 
sequence for PRO 1383. 

The entire nucleotide sequence of DNA58743-1609 is shown in Figure 162 (SEQ ID NO:240). Clone 
DNA58743-1609 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 122-124 and ending at the stop codon at nucleotide positions 1391-1393 (Figure 162). The predicted 

30 polypeptide precursor is 423 amino acids long (Figure 163). The full-length PR01383 protein shown in Figure 
163 has an estimated molecular weight of about 46,989 daltons and a pi of about 6.77. Analysis of the full- 
length PR01383 sequence shown in Figure 163 (SEQ ID NO:241) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino acid 
339 to about amino acid 362, and potential N-glycosylation sites from about amino acid 34 to about amino acid 

35 37, from about amino acid 58 to about amino acid 61 , from about amino acid 142 to about amino acid 145, from 
about amino acid 197 to about amino acid 200, from about amino acid 300 to about amino acid 303 and from 
about amino acid 364 to about amino acid 367. Clone DNA58743-1609 has been deposited with ATCC on 
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August 25, 1998 and is assigned ATCC deposit no. 203154. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 163 (SEQ ID NO:241), evidenced significant 
homology between the PRO 1383 amino acid sequence and the following Dayhoff sequences: NMBHUMAN, 
QNR_COTJA, P_W38335, P115CHICK, P_W38164, A45993J, MMU70209J, D83704J and P_W39176. 

5 

EXAMPLE 71 : Isolation of cPNA Clones Encoding Human PRO 1003 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 43055. This sequence was then compared to a variety of EST 
databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 

10 (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA, ^o identify existing homologies. The homology search 
was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in Enzvmology 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated consenOl . 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2849382, the Incyte EST clone 2849382 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 164. 

20 The entire nucleotide sequence of DNA5 8846- 1409 is shown in Figure 164 (SEQ ID NO:245). Clone 

DNA58846-1409 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 41-43 and ending at the stop codon at nucleotide positions 293-295 (Figure 164). The predicted 
polypeptide precursor is 84 amino acids long (Figure 165). The full-length PRO1003 protein shown in Figure 
165 has an estimated molecular weight of about 9,408 daltons and a pi of about 9.28. Analysis of the full-length 

25 PRO 1003 sequence shown in Figure 165 (SEQ ID NO:246) evidences the presence of a signal peptide at amino 
acids 1 to about 24, and a cAMP- and cGMP-dependent protein kinase phosphorylation site at about amino acids 
58 to about 61 . Analysis of the amino acid sequence of the full-length PRO 1003 polypeptide using the Dayhoff 
database (version 35.45 SwissProt 35) evidenced homology between the PRO 1003 amino acid sequence and the 
following Dayhoff sequences: AOPCZA363 3, SRTX ATREN, A48298, MHVJHMS 1, VGL2_CVMJH, 

30 DHDHTC2 2, CORTRAT, TAL6_HUMAN, P_W14123, and DVUFI 2. 

EXAMPLE 72 : Isolation of cDNA Clones Encoding Human PRO 11 08 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53237. 
35 In light of an observed sequence homology between the DNA53237 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2379881, the Incyte EST clone 2379881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 166 and is herein designated DNA58848-1472. 

The entire nucleotide sequence of DNA58848-1472 is shown in Figure 166 (SEQ ID NO:247). Clone 
DNA58848-1472 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 1445-1447 (Figure 166). The predicted 
polypeptide precursor is 456 amino acids long (Figure 167). The full-length PROl 108 protein shown in Figure 
5 167 has an estimated molecular weight of about 52,071 daltons and a pi of about 9,46. Analysis of the full- 
length PROl 108 sequence shown in Figure 167 (SEQ ID NO:248) evidences the presence of the fo!lowing:type 
II transmembrane domains from about amino acid 22 to about amino acid 42, from about amino acid 156 to 
about amino acid 176, from about amino acid 180 to about amino acid 199 and from about amino acid 369 to 
about amino acid 388, potential N-glycosylaion sites from about amino acid 247 to about amino acid 250, from 

10 about a ino acid 327 to about amino acid 330, from about amino acid 328 to about amino acid 331 and from 
about amino acid 362 to about amino acid 365 and an amino acid block having homology to ER lumen protein 
retaining receptor protein from about amino acid 153 to about amino acid 190. Clone DNA58848-1472 has been 
deposited with ATCC on June 9, 1998 and is assigned ATCC deposit no. 209955. 

Analysis of the amino acid sequence of the full-length PROl 108 polypeptide suggests that it possesses 

15 significant sequence similarity to the LPAAT protein, thereby indicating that PROl 108 may be a novel LPAAT 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PROl 108 amino acid sequence and the following Dayhoff sequences, 
AF015811J, CER07E3_2, YL35 CAEEL, S73863, CEF59F4_4, P_W06422, MMU41736_1, MTV008_39, 
PR99248 and Y67 BPT7 . 

20 

EXAMPLE 73 : Isolation of cDNA Clones Encoding Human PROl 137 

The extracellular domain (ECD) sequences (including the secretion signal, if any) of from about 950 

known secreted proteins from the Swiss-Prot public protein database were used to search expressed sequence 

tag (EST) databases. The EST databases included public EST databases (e.g., GenBank) and a proprietary EST 
25 DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the 

computer program BLAST or BLAST2 (Altshul et ah, Methods in Enzvrnology 266:460-480 (1996)) as a 

comparison of the ECD protein sequences to a 6 frame translation of the EST sequence. Using this procedure, 

Incyte EST No. 3459449, also referred to herein as "DNA7108", was identified as an EST having a BLAST 

score of 70 or greater that did not encode a known protein. 
30 A consensus DNA sequence was assembled relative to the DNA7108 sequence and other ESTs using 

repeated cycles of BLAST and the program "phrap" (Phil Green, Univ. of Washington, Seattle, WA). The 

consensus sequence obtained therefrom is referred to herein as DNA53952. 

In light of an observed sequence homology between the DNA53952 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3663102, the Incyte EST clone 3663102 was purchased 
35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 168. 
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The entire nucleotide sequence of DNA58849-1494 is shown in Figure 168 (SEQ ID NO:249). Clone 
DNA58849-1494 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 797-799 (Figure 168). The predicted 
polypeptide precursor is 240 amino acids long (Figure 169). The full-length PRO 11 37 protein shown in Figure 
169 has an estimated molecular weight of about 26,064 daltons and a pi of about 8.65. Analysis of the full- 
5 length PROH37 sequence shown in Figure 169 (SEQ ID NO:250) evidences the presence of a signal peptide 
at about amino acids 1 to 14 and a potential N-glycosylation site at about amino acids 101-105. 

Analysis of the amino acid sequence of the full-length PROl 137 polypeptide suggests that it possesses 
significant sequence similarity to ribosyltransferase thereby indicating that PRO 11 37 may be a novel member 
of the ribosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length PROl 137 
10 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PRO 1 137 
amino acid sequence and the following Dayhoff sequences: MMART51, NARG MOUSE, GEN11909, 
GEN 13794, GEN 14406, MMRNART62 1, and P_R41876. 



EXAMPLE 74 : Isolation of cDNA clones Encodin2 Human PROl 138 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence, Incyte cluster sequence no. 1652 12. This cluster sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a 
proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated as DNA54224. The assembly included a proprietary 
Genentech EST designated herein as DNA49140 (Figure 172; SEQ ID NO:254). 

25 In light of an observed sequence homology between the DNA54224 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3836613, the Incyte EST clone 3836613 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 170 and is the full-length DNA sequence for PROl 138. 
Clone DNA58850-1495 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 

30 209956. 

The entire nucleotide sequence of DNA58850-1495 is shown in Figure 170 (SEQ ID NO:252). Clone 
DNA58850-1495 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 38-40 and ending at the stop codon at nucleotide positions 1043-1045 (Figure 170), The predicted 
polypeptide precursor is 335 amino acids long (Figure 171). The full-length PROl 138 protein shown in Figure 
35 171 has an estimated molecular weight of about 37,421 Daltons and a pi of about 6.36. Analysis of the full- 
length PROl 138 sequence shown in Figure 171 (SEQ ID NO:253) evidences the presence of the following 
features: a signal peptide at about amino acid 1 to about amino acid 22; a transmembrane domain at about amino 
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acids 224 to about 250; a leucine zipper pattern at about amino acids 229 to about 250; and potential N- 
glycosylation sites at about amino acids 98- 101, 142-145, 148-151, 172-175, 176-179, 204-207, and 291-295. 

Analysis of the amino acid sequence of the full-length PROl 138 polypeptide suggests that it possesses 
significant sequence similarity to the CD84, thereby indicating that PROl 138 may be a novel member of the Ig 
superfamily of polypeptides. More particularly, analysis of the amino acid sequence of the full-length PROl 138 
5 polypeptide using the Dayhoff database (version 35 .45 SwissProt 35) evidenced homology between the PRO 1138 
amino acid sequence and the following Dayhoff sequences: HSU82988_1 , HUMLY91 , PR9763 1 , PR97628, 
P_R97629, P_R97630, CD48 RAT, CD2_HUMAN, P_P93996, and HUMBGPJ. 

Clone DNA58850-1495 was deposited with ATCC on June 9, 1998, and is assigned ATCC deposit no. 

209956. 

10 

EXAMPLE 75 : Isolation of cDNA clones Encoding Human PRO 1 054 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 66212. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 

15 and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 

20 consensus sequence obtained therefrom is herein designated DNA55722. 

In light of an observed sequence homology between the DNA55722 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 31975 1, the Incyte EST clone 319751 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 173 and is herein designated as DNA58853-1423. 

25 Clone DNA58853-1423 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 586-588 (Figure 173). The 
predicted polypeptide precursor is 180 amino acids long (Figure 174). The full-length PRO1054 protein shown 
in Figure 174 has an estimated molecular weight of about 20,638 daltons and a pi of about 5.0. Analysis of the 
full-length PRO1054 sequence shown in Figure 174 (SEQ ID NO:256) evidences the presence of the following; 

30 a signal peptide from about amino acid i to about amino acid 18, a leucine zipper pattern from about amino acid 
155 to about amino acid 176 and amino acid sequence blocks having homology to lipocalin proteins from about 
amino acid 27 to about amino acid 38 and from about amino acid 110 to about amino acid 120. Clone 
DNA58853-1423 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203016. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 174 (SEQ ID NO:256), evidenced significant 
homology between the PRO1054 amino acid sequence and the following Dayhoff sequences: MUP1 MOUSE, 
MUP6_MOUSE, MUP2MOUSE, MUP8 MOUSE, MUP5 MOUSE, MUP4JvfOUSE, S10124, 
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MUPM_MOUSE, MUPRAT and ECU70823J. 

EXAMPLE 76 : Isolation of cDN A clones Encoding Human PRQ994 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 157555. This EST cluster sequence was then compared 
5 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

10 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55728. 

In light of an observed sequence homology between the DNA55728 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2860366, the Incyte EST clone 2860366 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

15 The sequence of this cDNA insert is shown in Figure 175 and is herein designated as DNA58855-1422. 

Clone DNA58855-1422 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 3 1 -33 and ending at the stop codon at nucleotide positions 7 18-720 (Figure 175). The 
predicted polypeptide precursor is 229 amino acids long (Figure 176). The full-length PR0994 protein shown 
in Figure 176 has an estimated molecular weight of about 25, 109 daitons and a pi of about 6.83. Analysis of 

20 the full-length PR0994 sequence shown in Figure 176 (SEQ ID NO; 258) evidences the presence of the 
following: transmembrane domains from about amino acid 10 to about amino acid 31, from about amino acid 
50 to about amino acid 72, from about amino acid 87 to about amino acid 1 10 and from about amino acid 191 
to about amino acid 213, potential N-glycosylation sites from about amino acid 80 to about amino acid 83, from 
about amino acid 132 to about amino acid 135, from about amino acid 148 to about amino acid 151 and from 

25 about amino acid 163 to about amino acid 166 and an amino acid block having homology to TNFR/NGFR 
cysteine-rich region proteins from about amino acid 4 to about amino acid 1 1 . Clone DNA58855-1422 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203018. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of thw full-length sequence shown in Figure 176 (SEQ ID NO: 25 8), evidenced significant 

30 homology between the PR0994 amino acid sequence and the following Dayhoff sequences: AF027204 1, 
TAL6 HUMAN, ILT4 HUMAN, JC6205, MMU57570 1, S40363, ETU56093J, S42858, P_R66849 and 
P_R74751. 

EXAMPLE 77 : Isolation of cDNA clones Encoding Human PRQ812 
35 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 170079. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
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and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phn ^.cen, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated as DNA55721. 

In light of an observed sequence homology between the DNA55721 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 388964, ti.a Incyte EST clone 388964 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 177 and is herein designated as DNA59205-1421. 

Clone DNA59205-1421 contains a sing' open reading frame with an apparent translational initiation 
site at nucleotide positions 55-57 and ending at the stop codon at nucleotide positions 304-306 (Figure 177). The 
predicted polypeptide precursor is 83 amino acids long (Figure 178). The full-length PR0812 protein shown 
in Figure 178 has an estimated molecular weight of about 9,201 daltons and a pi of about 9.3. Analysis of the 
full-length PR0812 sequence shown in Figure 178 (SEQ ID NO: 260) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 15, a cAMP- and cGMP-dependent protein kinase 
phosphorylation site from about amino acid 73 to about amino acid 76 and protein kinase C phosphorylation sites 
from about amino acid 70 to about amino acid 72 and from about amino acid 76 to about amino acid 78. Clone 
DNA59205-1421 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203009. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 178 (SEQ ID NO:260), evidenced significant 
homology between the PROS 12 amino acid sequence and the following Dayhoff sequences: P_W35802, 
P W35803, PSC1 RAT, S68231, GEN13917, PSC2JRAT, CC 1 OJWM A N,UTER_R ABIT, A F008595J and 
A56413. 

EXAMPLE 78 : Isolation of cDNA clones Encoding Human PRO 1069 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence designated herein as 100727. This sequence was then compared to a proprietary EST 
DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 
therefrom is herein designated DNA56001. 

In light of an observed sequence homology between the DNA56001 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3533881, the Incyte EST clone 3533881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 179 and is the full-length DNA sequence for PRO1069. 
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Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 
209960. 

The entire nucleotide sequence of DNA5921 1-1450 is shown in Figure 179 (SEQ ID NO:261). Clone 
DNA5921 1-1450 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 197-199 and ending at the stop codon at nucleotide positions 464-466, The predicted polypeptide 
5 precursor is 89 amino acids long (Figure 180). The full-length PRO 1069 protein shown in Figure 180 has an 
estimated molecular weight of about 9,433 daltons and a pi of about 8.21 . Analysis of the full-length PRO 1069 
sequence shown in Figure 180 (SEQ ID NO: 262) evidences the presence of the following features: a signal 
peptide sequence at amino acid 1 to about 16; a transmembrane domain at about amino acids 36 to about 59; 
potential N-myristoylation sites at about amino acids 41-46, 45-50, and 84-89; and homology with extracellular 
10 protein^ JCP/Tpx-l/Ag5/PR-i/Sc7 at about amino acids 54 to about 66. 

Analysis of the amino acid sequence of th? full-length PRO 1069 polypeptide suggests that it possesses 
significant sequence similarity to CHIF, thereby indicating that PRO 1069 may be a member of the CHIF family 
of polypeptides. More particularly, analysis of the amino acid sequence of the full-length PRO1069 polypeptide 
using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PRO 1069 amino acid 
15 sequence and the following Dayhoff sequences: CHIF RAT, A55571, PLMJIUMAN, A40533, 
ATNG BOVIN, RIC_MOUSE, PETDJSYNY3, VTB1_XENLA, A05009, and S75086. 

Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209960. 

20 EXAMPLE 79 : Isolation of cDNA Clones Encoding Human PROl 129 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 98833. The Incyte EST cluster sequence no. 98833 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 

25 to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmoiogy 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56038. 

30 In light of an observed sequence homology between the DNA56038 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no, 1335241 , the Incyte EST clone 1335241 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 181 and is herein designated DNA59213-1487. 

The full length clone shown in Figure 181 contained a single open reading frame with an apparent 

35 translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
1614-1616 (Figure 181; SEQ ID NO:263). The predicted polypeptide precursor is 524 amino acids long, has 
a calculated molecular weight of approximately 60,310 daltons and an estimated pi of approximately 7.46. 
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Analysis of the full-length PROH29 sequence shown in Figure 182 (SEQ ID NO:264) evidences the presence 
of the following: type II transmembrane domains from about amino acid 13 to about amino acid 32 and from 
about amino acid 77 to about amino acid 102, a cytochrome P-450 cysteine heme-iron ligand signature sequence 
from about amino acid 461 to about amino acid 470 and potential N-glycosylation sites from about amino acid 
1 12 to about amino acid 1 15 and from about amino acid 168 to about amino acid 171. Clone DNA59213-1487 
has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209959. 

Analysis of the amino acid sequence of the full-length PRO 1 129 polypeptide suggests that it possesses 
sequence similarity to the cytochrome P-450 family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PROl 129 amino acid 
sequence and the following Dayhoff sequences, AC0G4523_t, S45702, AF054821_1 and 153015. 



10 



EXAMPLE 80 : Isolation of cDNA clones Encoding Human PRO1068 
. Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the LIFESEQ® database, designated Incyte cluster no. 141736. This EST cluster sequence 

was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
15 (e.g., GenBank) and a proprietary EST DNA database (HFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 

identify existing homologies. One or more of the ESTs was derived from a human mast cell line from a patient 
^ with mast cell leukemia. The homology search was performed using the computer program BLAST or BLAST2 

(Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 

of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
20 consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 

Washington). The consensus sequence obtained therefrom is herein designated DNA56094. 

In light of an observed sequence homology between the DNA56094 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 004974, the Incyte EST clone 004974 was purchased 

and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
25 The sequence of this cDNA insert is shown in Figure 183 and is herein designated as DNA59214-1449 (SEQ 

ID NO:265). 

The full length clone shown in Figure 183 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
^ 414-416 (Figure 183; SEQ ID NO:265). The predicted polypeptide precursor (Figure 184, SEQ ID NO:266) 

30 is 124 amino acids long. PRO1068 has a calculated molecular weight of approximately 14,284 daltons and an 
estimated pi of approximately 8.14. The PRO1068 polypeptide has the following additional features, as 
- indicated in Figure 184: a signal peptide sequence at about amino acids 1-20, a urotensin II signature sequence 

at about amino acids 118-123, a cell attachment sequence at about amino acids 64-66, and a potential cAMP- 
and cGMP-dependent protein kinase phosphorylation site at about amino acids 112-1 15. 
35 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 184 (SEQ ID NO: 266), revealed homology 
^ between the PRO1068 amino acid sequence and the following Dayhoff sequences: HALBOP1, MTV043 36, 
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150498, and PR78445 

Clone DNA59214-1449 was deposited with the ATCC on July I, 1998 and is assigned ATCC deposit 
no.203046. 

EXAMPLE 81 : Isolation of cDNA clones Encoding Human PRO 1066 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST cluster sequence designated herein as 79066. The Incyte EST cluster sequence no. 79066 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 

10 BLAST2 (Aitshul et ah, Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56121. 

In light of an observed sequence homology between the DNA56121 consensus sequence and an EST 

1 5 sequence encompassed within the Incyte EST clone no . 1515315, the Incyte EST clone 1515315 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 185 and is herein designated DNA59215-1425. 

The full length clone shown in Figure 185 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 176-178 and ending at the stop codon found at nucleotide 

20 positions 527-529 (Figure 185; SEQ ID NO:267). The predicted polypeptide precursor is 1 17 amino acids long, 
has a calculated molecular weight of approximately 12,91 1 daltons and an estimated pi of approximately 5.46. 
Analysis of the full-length PRO1066 sequence shown in Figure 186 (SEQ ID NO:268) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a cAMP- and cGMP- 
dependent protein kinase phosphorylation site from about amino acid 38 to about amino acid 41 and potential 

25 N-myristoylation sites from about amino acid 5 to about amino acid 10, from about amino acid 63 to about amino 
acid 68 and from about amino acid 83 to about amino acid 88. Clone UNQ524 (DNA592 15-1425) has been 
deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209961. 

Analysis of the amino acid sequence of the full-length PRO 1066 polypeptide suggests that it does not 
possess significant sequence similarity to any known human protein. However, an analyrs of the Dayhoff 

30 database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO 1066 amino acid 
sequence and the following Dayhoff sequences, MOTI HUMAN, AF025667J, MTCY19H9 8 and 
RABIGKCHl. 

EXAMPLE 82 : Isolation of cDNA Clones Encoding Human PRO 1 184 
35 Use of the signal sequence algorithm described in Example 3 on ESTs from an Incyte database allowed 

identification a candidate sequence designated herein as DNA56375. This sequence was then compared to a 
variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
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a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmolo£V 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
5 consensus sequence obtained therefrom is herein designated DNA56375. 

In light of an observed sequence homology between the DNA56375 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1428374, the Incyte EST clone 1428374 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 187. 

10 The full length clone shown in Figure 187 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 106-108 and ending at the stop codon found at nucleotide 
positions 532-534 (Figure 187; SEQ ID NO:269). The predicted polypeptide precursor is 142 amino acids long, 
has a calculated molecular weight of approximately 15,690 daltons and an estimated pi of approximately 9.64. 
Analysis of the full-length PRO 1 184 sequence shown in Figure 188 (SEQ ID NO:270) evidences the presence 

15 of a signal peptide at about amino acids 1-38. Clone DNA59220-1514 has been deposited with the ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual sequences and that representations are 
presented herein. 

Analysis of the amino acid sequence of the full-length PRO 1 184 polypeptide suggests that it possesses 
some sequence identity with a protein called TIM from Drosophila virilis, designated "DVTIMS021" in the 
20 Dayhoff data base, (version 35.45 SwissProt 35). Other 

Dayhoff database (version 35.45 SwissProt 35) sequences having some degree of sequence identity with 
PROH84 include: WIS1 SCHPO, F002186 1, ATAC00239124 and MSAIPRP 1. 



EXAMPLE 83 : Isolation of cDNA clones Encoding Human PRO 1360 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

sequence from an Incyte database, designated DNA 10572. This EST sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e,g, , GenBank, Merck/Wash. 
U.) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

30 (Altshul et al., Methods in Enzvrnology 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA57314. 

In light of an observed sequence homology between the DN A573 14 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. AA406443, the Merck EST clone AA406443 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 189 and is herein designated as DNA59488-1603. 
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The full length clone shown in Figure 189 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 54-56 and ending at the stop codon found at nucleotide positions 
909-91 1 (Figure 189; SEQ ID NO:271). The predicted polypeptide precursor (Figure 190, SEQ ID NO:272) 
is 285 amino acids long. PRO 1360 has a calculated molecular weight of approximately 31,433 daltons and an 
' estimated pi of approximately 7.32. Clone DNA^" <?S-1603 was deposited with the ATCC on August 25, 1998 

;^ 5 and is assigned ATCC deposit no. 203157. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
M alignment analysis of the full-length sequence shown in Figure 190 (SLQ ID NO:272), revealed sequence identity 

ll between the PRO 1360 amino acid sequence and the following Dayhoff sequences: UN51CAEEL, 

YD4BSCHPO, AF000634J, GFO ZYMMO, YEU SCHPO, D86566 1, ZMGFOJ, S76976, 
10 PPSA SYNY3, and CEF28B14. 

** EXAMPLE 84 : isolation of cDNA clones Encoding Human PRO 1029 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 18763. This EST cluster sequence was then compared 
15 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
^ homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

- et al„ Methods in Enzvmology 266:460-480 (1996^. Those comparisons resulting in a BLAST score of 70 (or 

in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
20 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57854. 

In light of an observed sequence homology between the DNA57854 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T98880, the Merck EST clone T98880 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
25 The sequence of this cDNA insert is shown in Figure 191 and is herein designated as DNA59493-1420. 

Clone DNA59493-1420 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 39-41 and ending at the stop codon at nucleotide positions 297-299 (Figure 191). The 
predicted polypeptide precursor is 86 amino acids long (Figure 192). The full-length PRO 1029 protein shown 
^ in Figure 192 has an estimated molecular weight of about 9,548 daltons and a pi of about 8.52. Analysis of the 

30 full-length PRO 1029 sequence shown in Figure 192 (SEQ ID NO:274) evidences the presence of the following; 
a signal peptide from about amino acid 1 to about amino acid 19, an amino acid block having homology to 
bacterial rhodopsins retinal binding site protein from about amino acid 50 to about amino acid 6 1 , a prenyl group 
binding site from about amino acid 83 to about amino acid 86 and a potential N-glycosylation site from about 
amino acid 45 to about amino acid 48. Clone DNA59493- 1420 has been deposited with ATCC on July 1 , 1998 
35 and is assigned ATCC deposit no. 203050, 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
: $ alignment analysis of the full-length sequence shown in Figure 192 (SEQ ID NO:274), evidenced significant 
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homology between the PRO 1029 amino acid sequence and the following Dayhoff sequences: S66088, 
AF031815_J, MM4A6LJ, PSEIS52a-l, S17699 and PJt63635. 

EXAMPLE 85 : Isolation of cDNA clones Encoding Human PROl 139 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
5 cluster sequence from the Incyte database, designated 4461. This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et ai.. Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 

10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57312. 

The DNA57312 consensus sequence included a 172 nucleotides long public EST (T62095, 
Merck/University of Washington public database). This EST clone, identified herein as a putative protein coding 

1 5 sequence, was purchased from Merck, and sequenced to provide the coding sequence of PRO 1 1 39 (Figure 193). 
As noted before, the deduced amino acid sequence of DNA59497- 1496 shows a significant sequence identity with 
the deduced amino acid sequence of HSOBRGRP1 . The full-length protein (Figure 194) contains a putative 
signal peptide between amino acid residues 1 and about 28, and three putative transmembrane domains 
(approximate amino acid residues 33-52, 71-89, 98-120). 

20 

EXAMPLE 86 : Isolation of cDNA clones Encoding Human PRO 1309 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed homology to SLIT. 

RNA for construction of cDNA libraries was isolated from human fetall brain tissue. The cDNA 
25 libraries used to isolate the cDN A clones encoding human PRO 1 309 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRII53 is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et aL, Science . 
30 253:1278-1280 (1991)) in the unique Xhol and NotI. 

The cDNA libraries (prepared as described above), were screened by hybridization with a synthetic 
oligonucleotide probe derived from the above described Incyte EST sequence: 

5 ' -TCCGTGC AGGGGG ACGCCTTTC AG AAACTGCGCCG AGTT A AGG A AC-3 ' (SEQ ID NO:279). 

A cDNA clone was isolated and sequenced in entirety. The entire nucleotide sequence of DNA59588- 
35 1571 is shown in Figure 195 (SEQ ID NO:277). Clone DNA59588-1571 contains a single open reading frame 
with an apparent translational initiation site at nucleotide positions 720-722 and a stop codon at nucleotide 
positions 2286-2288 (Figure 195; SEQ ID NO:277). The predicted polypeptide precursor is 522 amino acids 
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long. The signal peptide is approximately at 1-34 and the transmembrane domain is at approximately 428-450 
of SEQ ID NO:278. Clone DNA59588-I571 has been deposited with ATCC and is assigned ATCC deposit no. 
203106. The full-length PRO 1309 protein shown in Figure 196 has an estimated molecular weight of about 
58,614 daltons and a pi of about 7.42. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
5 alignment analysis of the full-length sequence shown in Figure 1 96 (SEQ ID NO:278), revealed sequence identity 
between the PRO1309 amino acid sequence and the following Dayhoff sequences: AB0078761 1 GPV MOUSE, 
ALS RAT, PJR85889, LUM CHICK, ABO 14462 J, PGS1 CANFA, CEM887, A58532 and GEN11209. 

EXAMPLE 87 : Isolation of cDNA Clones Encoding Human PRO 1028 

10 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 

15 Enzvmology 266:460-480 ( 1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA59603. 

In light of an observed sequence homology between the DNA59603 sequence and an EST sequence 

20 contained within Incyte EST clone no. 1497725, the Incyte EST clone no. 1497725 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 197 and is herein designated as DNA59603-1419. 

The entire nucleotide sequence of DNA59603-1419 is shown in Figure 197 (SEQ ID NO:280). Clone 
DNA59603-1419 contains a single open reading frame with an apparent translational initiation site at nucleotide 

25 positions 21-23 and ending at the stop codon at nucleotide positions 612-614 (Figure 197). The predicted 
polypeptide precursor is 197 amino acids long (Figure 198). The full-length PRO1028 protein shown in Figure 
198 has an estimated molecular weight of about 20,832 daltons and a pi of about 8.74. Clone DNA59603-1419 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

30 Analyzing the amino acid sequence of SEQ ID NO:281, the putative signal peptide is at about amino 

acids 1-19 of SEQ ID NO:28L An N-glycosyiation site is at about amino acids 35-38 of SEQ ID NO:281. A 
C-type lectin domain is at about amino acids 108-1 17 of SEQ ID NO: 281 , indicating that PR0513 may be related 
to or be a lectin. The corresponding nucleotides of these amino acid sequences or others can be routinely 
determined given the sequences provided herein. 

35 
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EXAMPLE 88 : Isolation of cDNA Clones Encoding Human PRO 1027 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
^ EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

^ 5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
^ or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

^ the program "phrap" (Phil Green, University of Washington, Seattle, Washington), The consensus sequence 

obtained therefrom is herein designated DNA56399. 
10 In light of an observed sequence homology between the DNA56399 sequence and an EST sequence 

contained within Incyte EST clone no. 937605, the Incyte EST clone no. 937605 was purchased and the cDNA 
^ insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 

this cDNA insert is shown in Figure 199 and is herein designated as DNA59605-1418. 

The entire nucleotide sequence of DNA59605-1418 is shown in Figure 199 (SEQ ID NO:282). Clone 
15 DNA59605-1418 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 31-33 and ending at the stop codon at nucleotide positions 262-264 (Figure 199). The predicted 
|g polypeptide precursor is 77 amino acids long (Figure 200). The full-length PRO1027 protein shown in Figure 

m 200 has an estimated molecular weight of about 8,772 daltons and a pi of about 9.62. Clone DNA59605-1418 

has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
20 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:283, the putative signal peptide is at about amino 
acids 1-33 of SEQ ID NO:283. The type II fibronectin collagen-binding domain begins at about amino acid 30 
of SEQ ID NO:283. The corresponding nucleotides for these amino acid sequences and others can be routinely 
determined given the sequences provided herein. PRO 1027 may be involved in tissue formation or repair. 
25 The following Dayhoff designations appear to have some sequence identity with PRO 1027: 

SFT2 YEAST; ATM3E9J2; A69826; YM16_MARPO; E64896; U60193 2; MTLRAJ205 1; MCU60315 70; 
SPAS SHIFL; and S54213. 



EXAMPLE 89 : Isolation of cDNA Clones Encouxng Human PRO 1107 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

35 EnzymolosY 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56402. 

In light of an observed sequence homology between the DNA56402 sequence and an EST sequence 
contained within Incyte EST clone no. 3203694, the Incyte EST clone no. 3203694 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 201 and is herein designated as DNA59606-1471. 
5 The entire nucleotide sequence of DNA59606-1471 is shown in Figure 201 (SEQ ID NO:284). Clone 

DNA59606-1471 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 244-246 and ending at the stop codon at nucleotide positions 1675-1677 of SEQ ID NO:284 (Figure 
201). The predicted polypeptide precursor is 477 amino acids long (Figure 202). The full-length PRO1107 
protein shown in Figure 202 has an estimated molecular weight of about 54,668 daltons and a pi of about 6.33. 
10 Clone DNA59606-1471 has been deposited with ATCC on June 9, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 107 polypeptide suggests that it possesses 
significant sequence similarity to phosphodiesterase 1/nucleotide phyrophosphatase, human insulin receptor 
15 tyrosine kinase inhibitor, alkaline phosphodiesterase and autotaxin, thereby indicating that PROl 107 may have 
at least one or all of the activities of these proteins, and that PROl 107 is a novel phosphodiesterase. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PROl 107 amino acid sequence and at least the following Dayhoff sequences: AF005632 1, 
P_R79148, RNU78787_1, AF0602l8j4, A57080 and HUMATXT 1, 

20 

EXAMPLE 90 : Isolation of cDNA clones Encoding Human PROl 140 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence, Incyte cluster sequence No. 135917. This sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

25 EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 

30 therefrom is herein designated DNA56416. 

In light of an observed sequence homology between DNA56416 and an EST sequence contained within 
Incyte EST clone no. 3345705, Incyte EST clone no. 3345705 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59607-1497, which 
is shown in Figure 203, is the full-length DNA sequence for PROl 140. Clone DNA59607-1497 was deposited 

35 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209946. 

The entire nucleotide sequence of DNA59607-1497 is shown in Figure 203 (SEQ ID NO:286). Clone 
DNA59607-1497 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 210-212 and ending at the stop codon at nucleotide positions 975-977 (Figure 203). The predicted 
polypeptide precursor is 255 amino acids long (Figure 204). The full-length PROl 140 protein shown in Figure 
204 has an estimated molecular weight of about 29,405 daltons and a pi of about 7.64. Analysis of the full- 
length PROl 140 sequence shown in Figure 204 (SEQ ID NO: 287) evidences the presence of three 
transmembrane domains at about amino acids 101 to 118, 141 to 161 and 172 to 191. 
5 Analysis of the amino acid sequence of the full-length PRO 1 140 polypeptide using the Dayhoff database 

(version 35.45 SwissProt 35) evidenced homology between the PRO 1 140 amino acid sequence and the following 
Dayhoff sequences: AF023602J, AF000368J, CIN3 RAT, AF003373J, GEN13279, and AF003372J, 

Clone DNA59607-1497 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209946. 

10 

EXAMPLE 91 : Isolation of cDNA clones Encoding Human PROl 106 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence. This sequence was then compared to a variety of expressed sequence tag (EST) databases 
which included public EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 

15 Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzymology 266:460-480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
Univ. of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated 

20 DNA56423. 

In light of an observed sequence homology between DN A56423 and an EST sequence contained within 
Incyte EST clone no. 1711247, Incyte EST clone no. 171 1247 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA596O9-1470, which 
is shown in Figure 205, is the full-length DNA sequence for PROl 106. Clone DNA59609-1470 was deposited 

25 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209963. 

The entire nucleotide sequence of DNA59609-1470 is shown in Figure 205 (SEQ ID NO:288). Clone 
DNA59609-1470 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 61-63 and ending at the stop codon at nucleotide positions 1468- 1470 of SEQ ID NO:288 (Figure 205). 
The predicted polypeptide precursor is 469 amino acids long (Figure 206). The full-length PROl 106 protein 

30 shown in Figure 206 has an estimated molecular weight of about 52,689 daltons and a pi of about 8.68. It is 
understood that the skilled artisan can construct the polypeptide or nucleic acid encoding therefor to exclude any 
one or more of all of these domains. For example, the transmembrane domain region(s) and/or either of the 
amino terminal or carboxyl end can be excluded. Clone DNA59609-1470 has been deposited with ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that the 

35 sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 106 polypeptide suggests that it possesses 
significant sequence similarity to the peroxisomal ca-dependent solute carrier, thereby indicating that PROl 106 
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may be a novel transporter. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 
35) evidenced sequence identity between the PROl 106 amino acid sequence and at least the following Dayhoff 
sequences, AF004161_1, IG002N01J25, GDC_BOVIN and BT1MAIZE. 

EXAMPLE 92 : Isolation of cPNA clones Encoding Human PRO 1291 
j§ 5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 120480. This EST cluster sequence was then compared 
$ to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 

? and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 

homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
10 et ah. Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
;,i DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 

consensus sequence obtained therefrom is herein designated DNA56425, 

In light of an observed sequence homology between the DNA56425 sequence and an EST sequence 
15 encompassed within the Incyte EST clone no. 2798803, the Incyte EST clone 2798803 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
if sequence of this cDNA insert is shown in Figure 207 and is herein designated as DNA59610-I556. 

Clone DNA59610-1556 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 61-63 and ending at the stop codon at nucleotide positions 907-909 (Figure 207). The 
20 predicted polypeptide precursor is 282 amino acids long (Figure 208). The full-length PRO 1291 protein shown 
in Figure 208 has an estimated molecular weight of about 30,878 daltons and a pi of about 5.27. Analysis of 
the full-length PR01291 sequence shown in Figure 208 (SEQ ID NO:291) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 28, a transmembrane domain from about 
amino acid 258 to about amino acid 281 and potential N-glycosylation sites from about amino acid 1 12 to about 
25 amino acid 115, from about amino acid 160 to about amino acid 163, from about amino acid 190 to about amino 
acid 193, from about amino acid 196 to about amino acid 199, from about amino acid 205 to about amino acid 
208, from about amino acid 216 to about amino acid 219 and from about amino acid 220 to about amino acid 
223.. Clone DNA59610-1556 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209990. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 208 (SEQ ID NO:291), evidenced significant 
homology between the PR01291 amino acid sequence and the following Dayhoff sequences: HSU90552 1, 
HSU90144J, AF033107_1, HSB73_1, HSU90142J, GGCD80_i, P W34452, MOG MOUSE, B39371 and 
P_R71360. 

35 
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EXAMPLE 93 : isolation of cDNA clones Encoding Human PRO 1105 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56430. 

10 In light of an observed sequence homology between the DNA56430 sequence and an EST sequence 

encompassed within the Incyte EST clone no. 1853047, the Incyte EST clone 1853047 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 209 and is herein designated as DNA59612-1466. 

The entire nucleotide sequence of DNA59612-1466 is shown in Figure 209 (SEQ ID NO:292). Clone 

15 DNA59612- 1466 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 28-30 and ending at the stop codon at nucleotide positions 568-570 of SEQ ID NO:292 (Figure 209). 
The predicted polypeptide precursor is 180 amino acids long (Figure 210). The full-length PROl 105 protein 
shown in Figure 210 has an estimated molecular weight of about 20,040 daltons and a pi of about 8.35. Clone 
DNA59612-1466 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone 
20 has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 210, a signal peptide is at about amino acids 1-19 of SEQ ID NO:293 and 
transmembrane domains are shown at about amino acids 80-99 and 145-162 of SEQ ID NO:293. It is 
understood that the skilled artisan could form a polypeptide with all of or any combination or individual selection 
25 of these regions. It is also understood that the corresponding nucleic acids can be routinely identified and 
prepared based on the information provided herein. 

EXAMPLE 94 : Isolation of cDNA clones Encoding Human PROS 1 1 

Use of the signal science algorithm described in Example 3 above allowed identification of an EST 

30 cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

35 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program u phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56434. 
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In light of an observed sequence homology between the DNA56434 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1227491, the Incyte EST clone 1227491 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 211 and is herein designated as DNA59613-1417. 

The entire nucleotide sequence of DNA59613-I417 is shown in Figure 211 (SEQ ID NO:294). Clone 
# 5 DNA59613-1417 contains a single open reading frame with an apparent translational initiation site at nucleotide 

positions 233-235 and ending at the stop codon at nucleotide positions 944-946 (Figure 211). The predicted 
M : polypeptide precursor is 237 amino acids long (Figure 212). The full-length PR0511 protein shown in Figure 

3$ 212 has an estimated molecular weight of about 25,284 daltons and a pi of about 5.74. Clone DNA59613-1417 

has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
., } 10 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ 10 NO: 295, the putative signal peptide is at about amino 
acids 1-25 of SEQ ID NO:295. The N-glycosylation sites are at about amino acids 45-48, 73-76, 107-110, 118- 
121, 132-135, 172-175, 175-178 and 185-188 of SEQ ID NO:295. An arthropod defensins conserved region 
is at about amino acids 176-182 of SEQ ID NO: 295. A kringle domain begins at about amino acid 128 of SEQ 
15 ID NO:295 and a ly-6/u-PAR domain begins at about amino acid 6 of SEQ ID NO:295. The corresponding 
nucleotides of these amino acid sequences and others can be routinely determined given the sequences provided 
H herein. 

The designations appearing in a Dayhoff database with which PR05 1 1 has some sequence identity are 
as follows: SSC20F10_1; SF041083; P_W26579; S44208; JC2394; PSTA DICDf; A27020; S59310; 
20 RAG 1_R ABIT; and MUSBALBC1J. 

EXAMPLE 95 : Isolation of cDNA clones Encoding Human PROl 104 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
25 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
30 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
;j obtained therefrom is herein designated DNA56446. 

In light of an observed sequence homology between the DNA56446 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2837496, the Incyte EST clone 2837496 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
35 sequence of this cDNA insert is shown in Figure 213 and is herein designated as DNA59616-1465. 

The entire nucleotide sequence of DNA596 16-4 465 is shown in Figure 213 (SEQ ID NO: 296). Clone 
DN A59616-1465 contains a single open reading frame with an apparent translational initiation site at nucleotide 



453 



BNSDOC1D: <WO 9963088A2. I > 



WO 99/63088 



PCT/US99/12252 



positions 109-1 11 and ending at the stop codon at nucleotide positions 11 32-1 134 of SEQ ID NO: 296 (Figure 
213). The predicted polypeptide precursor is 341 amino acids long (Figure 214), The full-length PRO 1104 
protein shown in Figure 214 has an estimated molecular weight of about 36,769 daltons and a pi of about 9.03. 
Clone DNA59616-1465 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
5 sequencing techniques. 

Analyzing Figure 214, a signal peptide is at about amino acids 1-22 of SEQ ID NO: 297. N- 
myristoylation sites are at about amino acids 41-46, 110-115, 133-138, 167-172 and 179-184 of SEQ ID 
NO:297. 



10 EXAMPLE 96 : Isolation of cDNA clones Encoding Human PRO 1 100 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq®, lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

15 homology search was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the obtained consensus sequence and an EST 

20 sequence encompassed within the lncyte EST clone no. 2305379, the lncyte EST clone 2305379 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 215 and is herein designated as DNA59619-1464. 

The entire nucleotide sequence of DNA596I9-1464 is shown in Figure 215 (SEQ ID NO:298). Clone 
DNA59619-1464 contains a single open reading frame with an apparent translational initiation site at nucleotide 

25 positions 33-35 and ending at the stop codon at nucleotide positions 993-995 of SEQ ID NO:298 (Figure 215). 
The predicted polypeptide precursor is 320 amino acids long (Figure 216). The full-length PRO 1100 protein 
shown in Figure 216 has an estimated molecular weight of about 36 , 4 75 daltons and a pi of al out 7.29. Clone 
DNA59619-1464 has been deposited with ATCC on July 1, 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 

30 techniques. 

Upon analyzing SEQ ID NO:299, the approximate locations of the signal peptide, the transmembrane 
domains, an N-glycosylation site, an N-myristoyiation site, a CUB domain and an amiloride-sensitive sodium 
channel domain are present. It is believed that PRO 1 100 may function as a channel. The corresponding nucleic 
acids for these amino acids and others can be routinely determined given SEQ ID NO:299.. 

35 
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EXAMPLE 97 : Isolation of cDNA clones Encoding Human PRQ836 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)), Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained is herein designated DNA56453 . 

10 In light of an observed sequence homology between the DNA56453 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2610075, the Incyte EST clone 2610075 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 217 and is herein designated as DNA59620-1463. 

The entire nucleotide sequence of DNA59620-1463 is shown in Figure 217 (SEQ ID NO:300), Clone 

15 DNA59620-1463 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 65-67 and ending at the stop codon at nucleotide positions 1448-1450 of SEQ ID NO: 300 (Figure 217). 
The predicted polypeptide precursor is 461 amino acids long (Figure 218). The full-length PR0836 protein 
shown in Figure 218 has an estimated molecular weight of about 52,085 daltons and a pi of about 5.36. Analysis 
of the full-length PR0836 sequence shown in Figure 218 (SEQ ID NO: 301) evidences the presence of the 

20 following: a signal peptide, N-glycosylation sites, N-myristoylation sites, a domain conserved in the 
YJL126w/YLR351c/yhcX family of proteins, and a region having sequence identity with SLS1. Clone 
DNA59620-1463 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

25 Analysis of the amino acid sequence of the full-length PRO 83 6 polypeptide suggests that it possesses 

some sequence similarity to SLS 1 , thereby indicating that PR0836 may be involved in protein translocation of 
the ER. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some 
homology between the PRO 836 amino acid sequence and at least the following Dayhoff sequences, S58132, 
SPBC3B9J, S66714, CRU40057J and IMACAEEL. 

30 

EXAMPLE 98 : Isolation of cDNA clones Encoding Human PROl 141 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 11873. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
35 and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56518. 

In light of an observed sequence homology between the DNA56518 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2679995, the Incyte EST clone 2679995 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 219 and is herein designated as DNA59625-1498. 

Clone DNA59625-1498 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 204-206 and ending at the stop codon at nucleotide positions 945-947 (Figure 219). 
The predicted polypeptide precursor is 247 amino acids long (Figure 220). The full-length PROl 141 protein 
shown in Figure 220 has an estimated molecular weight of about 26,840 daitons and a pi of about 8.19. Analysis 
of the full-length PROl 141 sequence shown in Figure 220 (SEQ ID NO:303) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19 and transmembrane domains from 
about amino acid 38 to about amino acid 57, from about amino acid 67 to about amino acid 83, from about 
amino acid 117 to about amino acid 139 and from about amino acid 153 to about amino acid 170. Clone 
DNA59625-1498 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit no. 209992. 

An analysis of the Dayhoff database (version 35,45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 220 (SEQ ID NO:303), evidenced significant 
homology between the PROl 141 amino acid sequence and the following Dayhoff sequences: CEVF36H2L2, 
PCRB7PRJ1, AB000506 1, LEU95008J, MRU87980J5, YIGMECOLI, STU65700 1, GHU62778_1, 
CYST_SYNY3 and AF009567J. 

EXAMPLE 99 : Isolation of cDNA clones Encoding Human PROl 132 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA35934. Based on the DNA35934 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library iiat contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PROl 132. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5'-TCCTGTGACCACCCCTCTAACACC-3' (SEQ ID NO:310) and 
reverse PCR primer : 5 ' -CTGG A AC ATCTGCTGCCC AG ATTC-3 ' (SEQ ID NO:311). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
sequence which had the following nucleotide sequence: 

5 -GTCGGATGACAGCAGCAGCCGCATCATCAATGGATCCGACTGCGATATGC-3' (SEQ ID NO:312). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above, A positive library was then used to 
isolate clones encoding the PROl 132 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney. 



456 



WO 99/63088 



PCT/US99/12252 



DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 132 and the derived protein sequence for PRO 1 132. 

The entire nucleotide sequence of PROl 132 is shown in Figure 225 (SEQ ID NO:308). Clone 
DNA59767-1489 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 354-356 and a stop codon at nucleotide positions 1233-1235 (Figure 225; SEQ ID NO:308). The 
^ 5 predicted polypeptide precursor is 293 amino acids long. The signal peptide is at about amino acids 1-22 and 

the histidine active site is at about amino acids 104-109 of SEQ ID NO:309. Clone DNA59767-1489 has been 
^ deposited with ATCC (having the actual sequence rather than representations based on sequencing techniques 

^ as presented herein) and is assigned ATCC deposit no. 203108. The full-length PROl 132 protein shown in 

Figure 226 has an estimated molecular weight of about 32,020 daltons and a pi of about 8.7. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 226 (SEQ ID NO:? >9), revealed sequence identity 
between the PROl 132 amino acid sequence and the following Dayhoff sequences: SSU76256_1, P_W10694, 
MMAE000663 6, AF013988J, U66061 8, MMAE000665_2, VLMAE00066415, MMAE00066414, 
MMAE00G665_4 and MMAE00066412. 

15 

EXAMPLE 100 : Isolation of cDNA clones Encoding Human NL7 (PRO 1346) 
^ A single EST sequence (#1398422) was found in the LIFESEQ* database as described in Example 1 

above. This EST sequence was renamed as DNA45668. Based on the DNA45668 sequence, oligonucleotides 
were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use 
20 as probes to isolate a clone of the full-length coding sequence for NL7. 
PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5 ' -C AC ACGTCC A ACCTC A ATGGGC AG- 3 ' (SEQ ID NO:315) 
reverse PCR primer : 5'-GACCAGCAGGGCCAAGGACAAGG 3' (SEQ ID NO:316) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
25 DNA45668 sequence which had the following nucleotide sequence: 
hybridization probe : 

5 ' -GTTCTCTG AG ATGAAG ATCCGGCCGGTCCGGG AGTACCGCTT AG-3 ' 
(SEQ ID NO:317) 

^ In order to screen _ .cral libraries for a source of a full-length clone, DNA from the libraries was 

30 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the NL7 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from a human fetal kidney library (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for NL7 
(designated herein as DN A59776- 1600 [Figure 227 T SEQ ID NO: 3 1 3]) and the derived protein sequence for NL7 
35 (PRO 1346). 

The entire coding sequence of NL7 (PR01346) is shown in Figure 227 (SEQ ID NO:313). Clone 
DNA59776-1600 contains a single open reading frame with an apparent translational initiation site at nucleotide 



457 



BNSDOCID: <WO 9963088A2 .. 1 ... > 



WO 99/63088 



PCT/US99/12252 



positions 1 -3 and an apparent stop codon at nucleotide positions 1 384-1386. The predicted polypeptide precursor 
is 461 amino acids long. The protein contains an apparent type II transmembrane domain at amino acid positions 
from about 31 to about 50; fibrinogen beta and gamma chains C-terminal domain signature starting at about 
amino acid position 409, and a leucine zipper pattern starting at about amino acid positions 140, 147, 154 and 
161, respectively. Clone DNA59776-1600 has been deposited with ATCC and is assigned ATCC deposit no. 
5 203128. The full-length NL7 protein shown in Figure 228 has an estimated molecular weight of about 50,744 
daltons and a pi of about 6.38. 

Based on a WU-BLAST2 sequence alignment analysis (using the WU-BLAST2 computer program) of 
the full-length sequence, NL7 shows significant amino acid sequence identity to a human microfibril-associated 
glycoprotein (1 MFA4_HUMAN); to known TIE-2 Iigands and ligand homologues, ficolin, serum lectin and 
10 TGF-1 binding protein. 



EXAMPLE 101 : Isolation of cDNA clones Encoding Human PROl 131 

A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated 
DNA43546 (see Figure 231; SEQ ID NO:320). The DNA43546 sequence was then compared to a variety of 

1 5 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with 

20 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA45627. 

Based on the DNA45627 sequence, oligonucleotide probes were generated and used to screen a human 
library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is 
a precursor of pRK5D that does not contain the Sfil site; see, Holmes ei al., Science 253: 1278-*280 (1991)), 

25 and the cDNA size cut was less than 2800 bp. 

PCR primers (forward and 2 reverse) were synthesized: 
forward PCR primer 5 ' - ATGC AGGCC A AGTAC AGC AGC AC-3 ' (SEQ ID NO:321); 
reverse PCR primer 1 5 ' -C ATGCTG ACG ACTTCCTGC A AGC - 3 ' (SEQ ID NO:322); and 
reverse PCR primer 1 5 ' -CCAC AC AGTCTCTGCTTCTTGGG-3 ' (SEQ ID NO: 323) 

30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45627 

sequence which had the following nucleotide sequence: 
hybridization probe 

5'-ATGCTGGATGATGATGGGGACACCACCATGAGCCTGCATT-3' (SEQ ID NO:324). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 131 gene using the probe oligonucleotide and one of the PCR primers. 
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A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 144-146, and a stop signal at nucleotide positions 984-986 
(Figure 229; SEQ ID NO:318). The predicted polypeptide precursor is 280 amino acids long, has a calculated 
molecular weight of approximately 31,966 daltons and an estimated pi of approximately 6.26. The 
transmembrane domain sequence is at about 49-74 of SEQ ID NO:319 and the region having sequence identity 

^ 5 with LDL receptors is about 50-265 of SEQ ID NO:3I9. PROl 131 contains potential N-linked grycosyiation 

sites at amino acid positions 95-98 and 169-172 of SEQ ID NO:3 19. Clone DNA59777- 1480 has been deposited 

;| with the ATCC and is assigned ATCC deposit no. 2031 1 1 . 

%. An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in.Figure 230 (SEQ ID NO:319), evidenced some sequence 
10 identity between the PROl 131 amino acid sequence and the following Dayhoff sequences: A B0 107 KM, 149053, 
I49H5, RNU56863J, LY4A_MOUSE, 155686, MMU56404J, 149361, AF030313 1 and MMU09739J . 



EXAMPLE 102 : Isolation of cDNA clones Encoding Human PRO 1281 

A consensus DNA sequence was a^emoled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein as DNA35720. Based on the DNA35720 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
H of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 128 1 . 

PCR primers (forward and reverse) were synthesized: 
forward PCR primers : 
20 5'-TGGAAGGCTGCCGCAACGACAATC-3' (SEQ ID NO:327); 
5 1 -CTG ATGTGGCCGATGTTCTG-3 ' (SEQ ID NO:328); and 
5 * -ATGGCTCAGTGTGC AGAC AG-3 ' (SEQ ID NO:329). 
reverse PCR primers : 

5*-GCATGCTGCTCCGl GAAGTAGTCC-3' (SEQ ID NO:330); and 
25 5 * -ATGCATGGG AAAG A AGGCCTGCCC-3 ' (SEQ ID NO:331). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA35720 sequence 
which had the following nucleotide sequence: 
hybridization probe: 

^ 5 ' -TGC ACTGGTGACC A CGAGGGGGTGC ACT AT AGCC ATCTGG AGCTGAG-3 ' (SEQ ID NO:332). 

30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PR01281 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
35 PR01281 (designated herein as DNA59820-1549 [Figure 232, SEQ ID NO: 325]; and the derived protein 
sequence for PRO 1281. 
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The entire coding sequence of PR01281 is shown in Figure 232 (SEQ ID NO:325). Clone DNA59820- 
1549 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
228-230 and an apparent stop codon at nucleotide positions 2553-2555. The predicted polypeptide precursor 
is 775 amino acids long. The full-length PRO 1281 protein shown in Figure 233 has an estimated molecular 
weight of about 85,481 daltons and a pi of about 6.92. Additional features include a signal peptide at about 
5 amino acids 1-15; and potential N-glycosylation sites at about amino acids 138-141 and 361-364. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 233 (SEQ ID NO: 326), revealed some sequence 
identity between the PRO 128 1 amino acid sequence and the following Dayhoff sequences: S44860, CET24D11 , 
CEC38H2_3, CAC2_HAECO, B3A2HUMAN, S22373, CEF38A3 2, CEC34F6J2, CEC34F6_3, and 
10 CELT22B11J. 

Clone DNA59820-1549 has been deposited with ATCC and is assigned ATCC deposit no. 203129. 

EXAMPLE 103 : Isolation of cDNA clones Encoding Human PRO 1064 

A cDNA sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 

15 BLAST2 sequence alignment computer program, to have no significant sequence identity to any known human 
protein. This cDNA sequence is herein designated DNA45288. The DNA45288 sequence was then compared 
to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST database 
(LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The comparison 
was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzvmology, 

20 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington), This consensus sequence is 
herein designated DNA48609, Oligonucleotide primers based upon the DNA48609 sequence were then 
synthesized and employed to screen a human fetal kidney cDNA library which resulted in the identification of 

25 the DNA59827-1426 clone shown in Figure 234. The cloning vector was pRK5B (pRK5B is a precursor of 
pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 253:1278-1280 (1991)), and the cDNA 
size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PCR primer 5 , -CTGAGACCCTGCAGCA^CATCTG-3' (SEQ ID NO:336) 

30 reverse PCR primer 5'-GGTGCTTCTTGAGCCCCACTTAGC-3' (SEQ ID NO:337) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA48609 sequence which had the following nucleotide sequence 
hybridization probe 

5 ' - A ATCT AGCTTCTCC AGG ACTGTGGTCGCCCCGTCCGCTGT-3 ' (SEQ ID NO:338) 
35 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 532-534 and a stop signal at nucleotide positions 991-993 
(Figure 234, SEQ ID NO: 333). The predicted polypeptide precursor is 153 amino acids long, has a calculated 
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molecular weight of approximately 17,317 daltons and an estimated pi of approximately 5.17. Analysis of the 
full-length PRO 1064 sequence shown in Figure 235 (SEQ ID NO: 334) evidences the presence of the following; 
a signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino 
acid 89 to about amino acid 1 10, an indoIe-3-glycerol phosphate synthase homology block from about amino acid 
74 to about amino acid 105 and a Myb DNA binding domain protein repeat protein homology block from about 
5 amino acid 1 14 to about amino acid 137. Clone DNA59827-1426 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203089. 
*| An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

M alignment analysis of the full-length sequence shown in Figure 235 (SE^> ID NO: 334), evidenced homology 

between the PRO1064 amino acid sequence and the following Dayhoff sequences: MMNP15PRO_l, 
10 BP187PLYH_1 I CELF42G8_4,MMU58888_1,GEN14270 1 TUB8_SOLTU,RCN_MOUSE,HUMRBSY79_I, 
'"" SESENODA 1 and A21467J. 

EXAMPLE 104 : Isolation of cDNA clones Encoding Huuian PRO 1379 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein DNA45232. Based on the DNA45232 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
$| the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 

^ PR01379. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5 ' -TGG AC ACCGT ACCCTGGT ATCTGC - 3 ' (SEQ ID N0.341) 
reverse PCR primer 5'-CCAACTCTGAGGAGAGCAAGTGGC-3' (SEQ ID N0.342) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45232 sequence which had the following nucleotide sequence: 
hybridization probe 

25 5 '-TGT ATGTGC AC ACCCTCACC ATCACCTCC AAGGGC AAGG AG A AC-3 ' (SEQ ID N0:343). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1379 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated human fetal kidney tissue. 

30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01379 which is designated herein as DNA59828-1608 and shown in Figure 237 (SEQ ID NO:339); and the 
derived protein sequence for PRO 1379 (SEQ ID NO: 340). 

The entire coding sequence of PR01379 is shown in Figure 237 (SEQ ID NO:339). Clone DNA59828- 
1608 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 

35 10-12 and an apparent stop codon at nucleotide positions 1732-1734. The predicted polypeptide precursor is 574 
amino acids long. The full-length PRO 1379 protein shown in Figure 238 has an estimated molecular weight of 
about 65,355 daltons and a pi of about 8.73, Additional features include a signal peptide at about amino acids 
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1-17 and potential N-glycosylation sites at about amino acids 160-163, 287-290, and 323-326. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 238 (SEQ ID NO:340), revealed some homology 
between the PRO 1 379 amino acid sequenceand the following Dayhoff sequences : YH Y8_ YEAST, AF040625_ 1 , 
HP714394J, and HIV18U45630 1. 
5 Clone DNA59828-1608 has been deposited with ATCC and is assigned ATCC deposit no. 203158. 



■■u EXAMPLE 105 : Isolation of cPNA Clones Encodine Human PRQ844 

% An expressed sequence tag (EST) DNA database (LIFESEQ™, incyte Pharmaceuticals, Palo Alto, CA) 

was searched and an EST was identified which showed sequence identity with aLP. Based on the information 
10 and discoveries provided herein, the clone for this EST, Incyte clone no. 2657496 from a cancerous lung library 
was further examined. 

DNA sequencing of the insert for this clone gave a sequence (herein designated as DNA59838-1462; 
SEQ ID NO: 344) which includes the full-length DNA sequence for PRO 844 and the derived protein sequence 
for PR0844. 

15 The entire nucleotide sequence of DNA59838-1462 is shown in Figure 239 (SEQ ID NO:344). Clone 

DNA59838-1462 contains a single open reading frame with an apparent trans 1 at ional initiation site at nucleotide 
g| positions 5-7 and ending at the stop codon at nucleotide positions 338-340 of SEQ ID NO:344 (Figure 239). 

^: The predicted polypeptide precursor is 111 amino acids long (Figure 240). The full-length PR0844 protein 

shown in Figure 240 has an estimated molecular weight of about 12,050 daltons and a pi of about 5.45. Clone 
20 UNQ544 DNA59838-1462 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0844 polypeptide suggests that it possesses 
significant sequence similarity to serine protease inhibitors, thereby indicating that PR0844 may be a novel 
25 proteinase inhibitor. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0844 amino acid sequence and at least the following Dayhoff 
sequences, ALKl JiUMAN, P P82403, PP82402, ELAFJRUMAN and P_P60950. 

4$ EXAMPLE 106 : Isolation of cPNA Clones Encoding Human PRQ848 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
35 Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
m the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA55999. 

In light of an observed sequence homology between the DNA55999 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2768571 , the Incyte EST clone 2768571 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 241 and is herein designated as DNA59839-1461. 
5 The entire nucleotide sequence of DNA59839-1461 is shown in Figure 241 (SEQ ID NO:346). Clone 

DNA59839-1461 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 146-148 and ending at the stop codon at nucleotide positions 1946-1948 of SEQ ID NO:346 (Figure 
241), The predicted polypeptide precursor is 600 amino acids long (Figure 242). The full-length PR0848 
protein shown in Figure 242 has an estimated molecular weight of about 68,536 daltons. Clone DNA59839- 1461 

10 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone has the actual nucleic 
acid sequence and that the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROS48 polypeptide suggests that it may be a 
novel sialyltransferase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0848 amino acid sequence and at least the following Dayhoff 

15 sequences, PJR78619 (GalNAc-alpha-2, 6-sialyltransferase), CAAG5CHICK (alpha-n-acetyigalactosamide 
alpha-2, 6-sialytransferase),HSU14550_l , CAG6 HUMAN and P R632 1 7 (human alpha-2, 3-sialyltransferase). 

EXAMPLE 107 : Isolation of cDNA Clones Encoding Human PRO 1097 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

20 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymojogy 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

25 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56006. 

In light of an observed sequence homology between the DNA56006 consensus sequence and an EST 
sequence encompassed within the Incyte EST done no. 2408105, the Incyte EST clone 2408105 purchased 

30 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 243 and is herein designated as DNA59841-1460. 

The entire nucleotide sequence of DNA59841-1460 is shown in Figure 243 (SEQ ID NO:348). Clone 
DNA5984 1-1460 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 3-5 and ending at the stop codon at nucleotide positions 276-278 of SEQ ID NO:348 (Figure 243). 

35 The predicted polypeptide precursor is 91 amino acids long (Figure 244). The full-length PRO1097 protein 
shown in Figure 244 has an estimated molecular weight of about 10,542 daltons and a pi of about 10.04. Clone 
DNA5984 1 - 1 460 has been deposited with ATCC on July 1 , 1 998 . It is understood that the deposited clone has 
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the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 244, the signal peptide is at about amino acids 1-20 of SEQ ID NO:349. The 
glycoprotease family protein domain starts at about amino acid 56, and the acy transferase ChoActase/COT/CPT 
family peptide starts at about amino acid 49 of SEQ ID NO: 349. 

5 

EXAMPLE 108 : Isolauon of cDNA clones Encoding Human PROl 153 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. » GenBank) and a proprietary 

10 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56008. 

In light of an observed sequence homology between the DNA56008 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2472409, the Incyte EST clone 2472409 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 245 and is herein designated as DNA59842-1502. 

20 The full length clone shown in Figure 245 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 92-94 and ending at the stop codon found at nucleotide positions 
683-685 (Figure 245; SEQ ID NO:350). The predicted polypeptide precursor (Figure 246, SEQ ID NO:351) 
is 197 amino acids long. PROl 153 has a calculated molecular weight of approximately 21 ,540 daltons and an 
estimated pi of approxirr nely 8.31. Clone DNA59842-1502 has been deposited with ATCC and is assigned 

25 ATCC deposit no. 209982. It is understood that the correct and actual sequence is in the deposited clone while 
herein are present representations based on current sequencing techniques which may have minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PR01I53 shows some amino acid sequence identity to the following Dayhoff designations: 
S57447; SOYHRGPC J; S46965; P_P82971; VCPHEROPHJ; EXTN_TOB AC ; MLCB2548J); 

30 ANXARABIT; JC5437 and SSGP_VOLCA. 

EXAMPLE 109 : Isolation of cDNA clones Encoding Human PROl 154 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
35 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
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Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56025. 

In light of an observed sequence homology between the DNA56025 consensus sequence and an EST 
5 sequence encompassed within the Incyte EST clone no. 2169375, the Incyte EST clone 2169375 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 247 and is herein designated as DNA59846-1503. 

The full length clone shown in Figure 247 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86-88 and ending at the stop codon found at nucleotide positions 
10 2909-2911 (Figure 247; SEQ ID NO:352). The predicted polypeptide precursor (Figure 248, SEQ ID NO:353) 
is 941 amino acids long. PRO 1 154 has a calculated molecular weight of approximately 107, 144 daltons and an 
estimated pi of approximately 6.26. Clone DNA59846-1503 has been deposited with ATCC and is assigned 
ATCC deposit no. 209978. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
15 length sequence, PROl 154 shows sequence identity to at least the following Dayhoff designations: AB01 1097_1 , 
AMPN HUMAN, RNU76997J, 159331, GEN 14047, HSU62768J , P R5 1281, CET07F10J, SSU66371_1, 
and AMPREJJUMAN. 

EXAMPLE 110 : Isolation of cDNA clones Encoding Human PROl 181 

20 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database, designated herein as 82468. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

25 (Altshul et aL, Methods in Enzvmology 266:460-480 (1996)), Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56029. 

In light of an observed sequence homology between the DNA56029 consensus sequence and an EST 

30 sequence encompassed within the Incyte EST clone no. 2186536, the Incyte EST clone 2186536 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 249 and is herein designated as DNA59847-15I1. 

Clone DNA59847-1511 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 17-19 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 249). 

35 The predicted polypeptide precursor is 437 amino acids long (Figure 250). The full-length PROl 181 protein 
shown in Figure 250 has an estimated molecular weight of about 46,363 daltons and a pi of about 6.22. Analysis 
of the full-length PRO 1181 sequence shown in Figure 250 (SEQ ID NO:355) evidences the presence of the 
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following: a signal peptide from about amino acid 1 to about amino acid 15, potential N-glycosylation sites from 
about amino acid 46 to about amino acid 49, from about amino acid 189 to about amino acid 192 and from about 
amino acid 382 to about amino acid 385 and amino acid sequence blocks having homology to Ly-6/u-PAR 
domain proteins from about amino acid 287 to about amino acid 300 and from about amino acid 98 to about 
amino acid 111. Clone DNA59847-1511 has been deposited with ATCC on August 4, 1998 and is assigned 
5 ATCC deposit no, 203098. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 250 (SEQ ID NO: 355), evidenced homology 
between the PROH81 amino acid sequence and the following Dayhoff sequences: AF041083_1, PW26579, 
RNMAGPIAN1, CELT13C2_2, LMSAP2GNJ, S61882, CEF35C5 12, DP87 DICDI, GIU4763IJ and 
10 P_R07092, 

EXAMPLE HI : Isolation of cDNA clones Encoding Human PRO 11 82 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 146647. This EST cluster sequence was 

15 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al. t Methods in Enzymology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

20 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56033, 

In light of an observed sequence homology between the DNA56033 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2595195, the Incyte EST clone 2595195 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 251 and is herein designated as DNA59848-1512. 

Clone DNA59848-1512 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 67-69 and ending at the stop codon at nucleotide positions 880-882 (Figure 251). The 
predicted polypeptide precursor is 271 amino acids long (Figure 252). The full-length PROl 182 protein shown 
in Figure 252 has an estimated molecular weight of about 28,665 daltons and a pi of about 5.33. Analysis of 

30 the full-length PROl 182 sequence shown in Figure 252 (SEQ ID NO:357) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25, an amino acid block having 
homology to C-type lectin domain proteins from about amino acid 247 to about amino acid 256 and an amino 
acid sequence block having homology to Clq domain proteins from about amino acid 44 to about amino acid 
77. Clone DNA59848-1512 has been deposited with ATCC on August 4, 1998 and is assigned ATCC deposit 

35 no. 203088. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 252 (SEQ ID NO:357) f evidenced significant 
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homology between the PRO 11 82 amino acid sequence and the following Dayhoff sequences: PSPD BOVIN, 
CL43BOVIN, CONGJBOVIN, P WI8780, PJ*45005, PJ*53257 and CELEGAP7J. 

EXAMPLE 112 : Isolation of cDNA clones Encoding Human PROI155 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
^ 5 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
§ EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

If homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
10 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
„ obtained therefrom is herein designated DNA56102. 

In light of an observed sequence homology between the DNA56102 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2858870, the Incyte EST clone 2858870 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 253 and is herein designated as DNA59849-1504. 
|§ The full length clone shown in Figure 253 contained a single open reading frame with an apparent 

^ translational initiation site at nucleotide positions 158-160 and ending at the stop codon found at nucleotide 

positions 563-565 (Figure 253; SEQ ID NO:358). The predicted polypeptide precursor (Figure 254, SEQ ID 
20 NO:359) is 135 amino acids long. PROl 155 has a calculated molecular weight of approximately 14,833 daltons 
and an estimated pi of approximately 9.78. Clone DNA59849-I504 has been deposited with ATCC and is 
assigned ATCC deposit no. 209986. It is understood that the actual clone has the correct sequence whereas 
herein are only representations which are prone to minor sequencing errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
25 length sequence, PROH55 shows some amino acid sequence identity with the following Dayhoff designations: 
TKNKBOVIN; PVBI9X587J; AF019049J; P_W00948; S72864; P W00949; 162742; AF038501_1; 
TKNG HUM AN ; and YAT1 RHOBL. Based on the information provided herein, PROl 155 may play a role 
in providing neuroprotection and cognitive enhancement. 

30 EXAMPLE 113 : Isolation of cDNA clones Encoding Human PROl 156 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 138851. This EST cluster sequence was 
then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
35 identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et aL, Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
*f s BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
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assembled into a consensus DNA sequence with the program w phrap w (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA5626I. 

In light of an observed sequence homology between the DNA56261 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3675191, the Incyte EST clone 3675191 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 255 and is herein designated as DNA59853-1505. 

The full length clone shown in Figure 255 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 212-214 and ending at the stop codon found at nucleotide 
positions 689-691 (Figure 255; SEQ ID NO:360). The predicted polypeptide precursor (Figure 256, SEQ ID 
NO:361) is 159 amino acids long. PROl 156 has a calculated molecular weight of approximately 17,476 daltons, 
10 an estimated pi of approximately 9.15, a signal peptide sequence at about amino acids 1 to about 22, and 
potential N-g v cosylation sites at about amino acids 27-30 and 41-44. 

Clone DNA59853-1505 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209985. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
15 alignment analysis (using the ALIGN computer program) of the full-length sequence shown in Figure 256 (SEQ 
ID NO:361), revealed some homology between the PROl 156 amino acid sequence and the following Dayhoff 
sequences: D45027J, PJR79914, JC5309, KBF2 HUMAN, AF010144J, GEN1435L S68681, PR79915, 
ZMTAC 3, and HUMCPGOJ. 

20 EXAMPLE 114 : Isolation of cDNA Clones Encoding Human PRO 1098 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

25 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56377. 

30 In light of an observed sequence homology between the DNA56377 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3050917, the Incyte EST clone 3050917 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 257 and is herein designated as DNA59854-1459. 

The entire nucleotide sequence of DNA59854-1459 is shown in Figure 257 (SEQ ID NO:362). Clone 

35 DNA59854-1459 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 58-60 and ending at the stop codon at nucleotide positions 292-294 of SEQ ID NO:362 (Figure 257). 
The predicted polypeptide precursor is 78 amino acids long (Figure 258). The full-length PRO 1098 protein 
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shown in Figure 258 has an estimated molecular weight of about 8,396 daltons and a pi of about 7.66. Clone 
DNA59854-1459 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 258, a signal peptide appears to be at about amino acids 1-19 of SEQ ID NO:363, 
>i 5 an N-glycosylation site appears to be at about amino acids 37-40 of SEQ ID NO:363, and N-myristoylation sites 

appear to be at about 15-20, 19-24 and 60-65 of SEQ ID NO:363. 

H EXAMPLE 115 ; Isolation of cDNA clones Encoding Human PROl 127 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
10 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmoloav 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
15 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
jg; obtained therefrom is herein designated DNA57959. 

* In light of an observed sequence homology between the DNA 5 7959 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 685126, the Merck EST clone 685126 was purchased 

20 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 259 and is herein designated as DNA60283-1484. 

The full length clone shown in Figure 259 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 126-128 and ending at the stop codon found at nucleotide 
positions 327-329 (Figur- 259; SEQ ID NO:364). The predicted polypeptide precursor (Figure 260, SEQ ID 

25 NO:365) is 67 amino acids long including a signal peptide at about 1-29 of SEQ ID NO:365. PROl 127 has a 
calculated molecular weight of approximately 7,528 daltons and an estimated pi of approximately 4.95. Clone 
DNA60283-1484 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit no. 203043. 
It is understood that the deposited clone has the actual sequence, whereas representations which may have minor 
45 sequencing errors are presented herein. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 260 (SEQ ID NO:365), revealed some homology 
between the PROl 127 amino acid sequence and the following Dayhoff sequences: AF037218__48, P W09638, 
HBA HETPO, S39821, KR2JEBV, CET20D3 8, HCU37630J, HS193B12J0, S40012 and TRITUBCJ . 

35 
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EXAMPLE 116: Isolation of cDNA clones Encoding Human PROl 126 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in 
Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56250. 

10 In light of an observed sequence homology between the DNA56250 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 1437250, the Incyte EST clone 1437250 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 261 and is herein designated as DNA6061 5-1483. 

Clone DNA60615-1483 contains a ^..gle open reading frame with an apparent translational initiation 

15 site at nucleotide positions 110-112 and ending at the stop codon at nucleotide positions 1316-1318 (Figure 261 ) . 
The predicted polypeptide precursor is 402 amino acids long (Figure 262). The full-length PROl 126 protein 
shown in Figure 262 has an estimated molecular weight of about 45,92 1 daltons and a pi of about 8.60, Analysis 
of the full-length PRO 1126 sequence shown in Figure 262 (SEQ ID NO: 367) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25 and potential N-glycosylation sites 

20 from about amino acid 66 to about amino acid 69, from about amino acid 1 38 to about amino acid 141 and from 
about amino acid 183 to about amino acid 186. Clone DNA60615-1483 has been deposited with ATCC on June 
16, 1998 and is assigned ATCC deposit no. 209980. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST2 sequence 
alignment analysis of the ^ull -length sequence shown in Figure 262 (SEQ ID NO: 367), evidenced significant 

25 homology between the PROl 126 amino acid sequence and the following Dayhoff sequences: 173636, 
NOMRJIUMAN, MMUSMYOC31, HS454G6_1, P_R98225, RNU78105J, RNU72487J, AF035301J, 
CEELC48E7_4 and CEFHC3J3. 

EXAMPLE 117 : Isolation of cDNA clones Encoding Human PROl 125 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 

35 Enzvrnology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56540. 

In light of an observed sequence homology between the DNA56540 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 14861 14, the Incyte EST clone 14861 14 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 263 and is herein designated as DNA60615-1483. 
5 The full length clone shown in Figure 263 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 47-49 and ending at the stop codon found at nucleotide positions 
1388-1390 (Figure 263; SEQ ID NO:368). The predicted polypeptide precursor (Figure 264, SEQ ID NO:369) 
is 447 amino acids long. PRO 1 125 has a calculated molecular weight of approximately 49,798 daltons and an 
estimated pi of approximately 9.78. Clone DNA60619-1482 has been deposited with ATCC and is assigned 

10 ATCC deposit no. 209993. It is understood that the clone has the actual sequence and that the sequences herein 
are representations based on current techniques which may be prone to minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PRO 1125 shows some sequence identity with the following Dayhoff designations: 
RCO!_NEUCR; S58306; PKWATHECU; S76086; PR85881; HET1PODAN; SPU92792J; 

15 APAFH UM AN ; S76414 and S59317. 

EXAMPLE 118 : Isolation of cDNA clones Encoding Human PRO! 186 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

20 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

25 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56748. 

In light of an observed sequence homology between the DNA56748 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3476792, the Incyte EST clone 3476792 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 265 and is herein designated as DNA60621-1516. 

The full length clone shown in Figure 265 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 91 -93 and ending at the stop codon found at nucleotide positions 
406-408 (Figure 265; SEQ ID NO:370). The predicted polypeptide precursor (Figure 266, SEQ ID NO:371) 
is 105 amino acids long. The signal peptide is at amino acids 1-19 of SEQ ID NO:371. PROH86 has a 

35 calculated molecular weight of approximately 1 1,715 daltons and an estimated pi of approximately 9.05. Clone 
DNA6062M516 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no, 203091 . 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 266 (SEQ ID NO:371), revealed some sequence 
identity between the PRO 11 86 amino acid sequence and the following Dayhoff sequences: VPRADENPO, 
LFE4CHICK, AF034208J, AF030433 1, A55035, COLRABIT, CELB0507 9, S67826J, S34665 and 
CRU73817J. 

5 

EXAMPLE 119 : Isolation of cDNA clones Encoding Human PRO 1198 

An initial DNA sequence referred to herein as DNA52083 was identified using a yeast screen in a 
human umbilical vein endothelial cell cDNA library that preferentially ^presents the 5' ends of the primary 
cDNA clones. DNA52083 was compared to ESTs from public database (e.g., GenBank), and a proprietary 
10 EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA), using the computer program BLAST or 
BLAST2 [Altschul et al., Methods in Enzvmology. _66;460-480 (1996)]. The ESTs were clustered and 
assembled into a consensus DNA sequence using the computer program "phrap" (Phil Green, University of 
Washington, Seattle, Washington). One or more of the ESTs was obtained from human breast skin tissue 
biopsy. This consensus sequence is designated herein as DNA52780. 

15 In light of an observed sequence homology between the DNA52780 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3852910, the Incyte EST clone 3852910 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 267 and is herein designated as DNA60622-1525. 

The full length DNA60622-1525 clone shown in Figure 267 (SEQ ID NO:372) contained a single open 

20 reading frame with an apparent translational initiation site at nucleotide positions 54 to 56 and ending at the stop 
codon found at nucleotide positions 741 to 743. The predicted polypeptide precursor, which is shown in Figure 
268 (SEQ ID NO: 373) , is 229 amino acids long. PRO 11 98 has a calculated molecular weight of approximately 
25,764 daltons and an estimated pi of approximately 9.17. There is a signal peptide sequence at about amino 
acids 1 through 34. There is sequence identity with glycosyl hydrolases family 31 protein at about amino acids 

25 142 to about 175. 

An analysis of the Dayhoff database (version 35,45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 268 (SEQ ID NO: 373), revealed some homology 
between the PRO 1 198 amino acid sequence and the following Dayhoff sequences: ATF6H 1 1_6, UCRI RAT, 
TOBSUP2NT 1, RCUERF3J, AMU88186_1, PJW22485, S56579, AF040711J, DPP4PIG. 
30 Clone DNA60622-1525 was been deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203090. 

EXAMPLE 120 : Isolation of cDNA clones Encoding Human PRO 1 158 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
35 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
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homology search was performed using the computer program BLAST or BLAST2 (Aitshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DN A sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57248. 
5 In light of an observed sequence homology between the DNA57248 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2640776, the Incyte EST clone 2640776 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 269 and is herein designated as DNA60625-1507. 

The full length clone shown in Figure 269 contained a single open reading frame with an apparent 

10 translational initiation site at nucleotide positions 163 to 165 and ending at the stop codon found at nucleotide 
positions 532 to 534 (Figure 269; SEQ ID NO:374). The predicted polypeptide precursor (Figure 270, SEQ 
ID NO:375) is 123 amino acids long. PR01158 has a calculated molecular weight of approximately 13,113 
daltons and an estimated pi of approximately 8.53. Additional features include a signal peptide sequence at about 
amino acids 1-19, a transmembrane domain at about amino acids 56-80, and a potential N-glycosylation site at 

15 about amino acids 36-39. Clone DNA60625-1507 was deposited with the ATCC on June 16, 1998 and is 
assigned ATCC deposit no. 209975. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 270 (SEQ ID NO; 375), revealed some homology 
between the PROH58 amino acid sequence and the following Dayhoff sequences: ATAC00310510F18A8.10, 

20 P_R85151, PHS2 SOLTU, RNMHCIBACJ, RNA1FMHC I, 168771, RNRTIA10G_1, PTPAJHfUMAN, 
HUMGACA l, and CHKPTPA_1 . 

EXAMPLE 121 : Isolation of cDNA clones Encoding Human PROl 159 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Aitshul et al., Methods in 
Enzvmology 266:460-480 (1996V*. Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA5722 1 . 

In light of an observed sequence homology between the DNA57221 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 376776, the Incyte EST clone 376776 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 271 and is herein designated as DNA60627-1508. 
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Clone DNA60627-1508 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 92-94 and ending at the stop codon at nucleotide positions 362-364 (Figure 27 1), The 
predicted polypeptide precursor is 90 amino acids long (Figure 272). The full-length PROl 159 protein shown 
in Figure 272 has an estimated molecular weight of about 9,840 daltons and a pi of about 10. 13. Analysis of 
the full-length PROl 159 sequence shown in Figure 272 (SEQ ID NO:377) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 15 and a potential N-glycosylation site 
from about amino acid 38 to about amino acid 41. Clone DNA60627-1508 has been deposited with ATCC on 
August 4, 1998 and is assigned ATCC deposit no. 203092. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 272 (SEQ ID NO: 377), evidenced significant 
10 homology between the PROl 159 amino acid sequence and the following Dayhoff sequences: AF016494 6, 
AF036708_20, DSSCUTEJ, D89100J , S28060, MEFAXENLA, AF020798_ 12, G70065 , E64423 , JQ2005 . 

EXAMPLE 122 : Isolation of cDNA clones Encoding Human PRO 11 24 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

20 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56035. 

In light of an observed sequence homology between the DNA56035 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2767646, the Incyte EST clone 2767646 wa purchased 

25 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 273 and is herein designated as DNA60629-I481 . 

The full length clone shown in Figure 273 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 25-27 and ending at the stop codon found at nucleotide positions 
2782-2784 (Figure 273; SEQ ID NO:378). The predicted polypeptide precursor (Figure 274, SEQ TO NO:379) 

30 is 919 amino acids long. PROl 124 has a calculated molecular weight of approximately 101,282 daltons and an 
estimated pi of approximately 5.37. Clone DNA60629-1481 has been deposited with the ATCC and is assigned 
ATCC deposit no. 209979. It is understood that the deposited clone has the actual sequence, whereas only 
representations based on current sequencing techniques which may include normal and minor errors, are 
provided herein. 

35 Based on a WU-BLAST2 sequence alignment analysis of the full-length sequence, PROl 124 shows 

significant amino acid sequence identity to a chloride channel protein and to ECAM-1. Specifically, the 
following Dayhoff designations were identified as having sequence identity with PROl 124: ECLCJBOVIN, 
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AF00I261J, P_W06548, SSC6A10J, AF004355_1, S76691, AF017642, BYU06866„2, CSA_DICDI and 
SAU47139_2. 

EXAMPLE 123 : Isolation of cDNA clones Encoding Human PRO 1287 

An expressed sequence tag (EST) DNA database (LIFESEQ®, lncyte Pharmaceuticals, Palo Alto, CA) 
M 5 was searched and an EST was identified which showed homology to the fringe protein. This EST sequence was 

then compared to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST 

f| database (LIFESEQ®, lncyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The 

M 

■§ comparison was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in 

Enzyroology, 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 
10 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence 
with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus 
™* sequence obtained is herein designated DNA40568. 

Based on the DNA40568 consensus sequence, oligonucleotides were synthesized: 1 ) to identify by PCR 
a cDNA library that contained the sequence zl interest, and 2) for use as probes to isolate a clone of the full- 
15 length coding sequence for PRO 1287. Forward and reverse PCR primers generally range from 20 to 30 
nucleotides and are often designed to give a PCR product of about 1 00-1000 bp in length. The probe sequences 
^ are typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when the consensus 

sequence is greater than about 1-1 .5kbp. In order to screen several libraries for a full-length clone, DNA from 
the libraries was screened by PCR amplification, as per Ausubel et al. , Current Protocols in Molecular Biology, 
20 supra, with the PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest 
using the probe oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CTCGGGGAAAGGGACTTGATGTTGG-3' (SEQ ID NO:382) 
reverse PCR primer 1 5 ' -OCG A AGGTG AGCCTCT ATCTCGTGCC-3 ' (SEQ ID NO:383) 
25 reverse PCR primer 2 5 ' -C AGCCTAC ACGT ATTG AGG-3 ' (SEQ ID NO:384) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40568 
sequence which had the following nucleotide sequence 
hybridization probe 

^ 5 ' -C AGTC AGTAC A ATCCTGGCATAAT AT ACGGCC ACC ATG ATGC AGTCCC-3 ' (SEQ ID NO:385). 

30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

3 screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 

to isolate clones encoding the PRO 1287 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human bone marrow tissue. The cDNA 
libraries used to isolated the cDNA clones were constructed by standard methods using commercially available 
35 reagents such as those from Invitrogen, San Diego, CA. The cDN A was primed with oligo dT containing a NotI 
site 7 linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pK T Q$ or pRKD; 
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pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science . 253 : 1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01287 (designated herein as DNA61755-1554 [Figure 275, SEQ ID NO:380]) and the derived protein 
sequence for PRO 1287. 

5 The entire nucleotide sequence of DNA61755-1554 is shown in Figure 275 (SEQ ID NO:380). The 

full length clone contained a single open reading frame with an apparent translational initiation site at nucleotide 
positions 655-657 and a stop signal at nucleotide positions 2251-2253 (Figure 275, SEQ ID NO:380). The 
predicted polypeptide precursor is 532 amino acids long, has a calculated molecular weight of approximately 
61 ,351 daltons and an estimated pi of approximately 8.77. Analysis of the full-length PRO 1287 sequence shown 

10 in Figure 276 (SEQ ID NO: 381) evidences the presence of the following; a signal peptide from about amino acid 
1 to about amino acid 27 and potential N-glycosylation sites from about amino acid 315 to about amino acid 318 
and from about amino acid 324 to about amino acid 327. Clone DNA61755-1554 has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 2031 12. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

15 alignment analysis of the full-length sequence shown in Figure 276 (SEQ ID NO:381), evidenced significant 
homology between the PR01287 amino acid sequence and the following Dayhoff sequences: CET24D1_J, 
EZRIJSOVIN, GGU19889J, CC3_YEAST, S74244, NALS MOUSE, MOES PIG, S28660, S44860 and 
YNA4_CAEEL. 

20 EXAMPLE 124 : Isolation of cDNA clones Encoding Human PR01312 

DNA55773 was identified in a human fetal kidney cDNA library using a yeast screen that preferentially 
represents the 5' ends of the primary cDNA clones. Based on the DNA55773 sequence, oligonucleotides were 
synthesized for use as probes to isolate a clone of the full-length coding sequence for PR01312. 

The full length DNA61873-1574 clone shown in Figure 277 (SEQ ID NO:386) contained a single open 

25 reading frame with an apparent translational initiation site at nucleotide positions 7-9 and ending at the stop 
codon found at nucleotide positions 643-645. The predicted polypeptide precursor is 212 amino acids long 
(Figure 278, SEQ ID NO:387). PR01312 has a calculated molecular weight of approximately 24,024 daltons 
and an estimated pi of approximately 6.26. Other features include a signal peptide at about amino acids 1-14; 
a transmembrane domain at about amino acids 141-160 and potential N-glycosylation sites at about amino acids 

30 76-79 and 93-96. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 278 (SEQ ID N0:387), revealed some homology 
between the PR01312 amino acid sequence and the following Dayhoff sequences: GCINTALPHJ, 
GIBMUC1A 1, P R96298, AF001406_1, PVU88874J, P_R85i51, AF041409 1, CELC50F2J7, C45875, 
35 and AB009S10_21. 

Clone DNA61873-1574 has been deposited with ATCC and is assigned ATCC deposit no. 203132. 
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EXAMPLE 125 : Isolation of cDNA clones Encoding Human PROl 192 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein DNA35924. Based on the DNA35924 
consensus sequence t oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
m 5 PROI192. 

PCR primers (forward and reverse) were synthesized: 
;| forward PCR primer : 5' -CCG AGGCC ATCTAG AGGCC AG AGC-3 ' (SEQ ID NO:390) 

;g reverse PCR primer : 5 ' - AC AGGC AGAGCC AATGGCC AG AGC- 3 ' (S £Q ID NO:391). 

Additionally, a synthetic oligonucleotide hybridization probe .vas constructed from the consensus 
10 DNA35924 sequence which had the following nucleotide sequence: 
hybridization probe: 

5'-GAGAGGACTGCGGGAGTTTGGGACCTTTGTGCAGACGTGCTCATG-3' (SEQ ID NO:392). 

In order to screen several libraries for a source of a full length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PROl 192 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver and spleen tissue. 
H DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

**f PROl 192 designated herein as DNA62814-152I and shown in Figure 279 (SEQ ID NO:388); and the derived 

protein sequence for PROl 192 which is shown in Figure 280 (SEQ ID NO:389). 
20 The entire coding sequence of PROl 192 is shown in Figure 279 (SEQ ID NO:388). Clone DNA62814- 

1521 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
121-123 and an apparent stop codon at nucleotide positions 766-768 . The predicted polypeptide precursor is 2 15 
amino acids long. The predicted polypeptide precursor has the following features: a signal peptide at about 
amino acids 1-21; a transmembrane domain at about amino acids 153-176; potential N-glycosylation sites at 
25 about amino acids 39-42 and 1 18-121; and homology with myelin P0 proteins at about amino acids 27-68 and 
99-128 of Figure 280. The full-length PROl 192 protein shown in Figure 280 has an estimated molecular weight 
of about 24 , 484 daltons and a pi of about 6.98. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
^ alignment analysis of the full-length sequence shown in Figure 280 (SEQ ID NO: 389), revealed homology 

30 between the PROl 192 amino acid sequence and the following Dayhoff sequences: GEN 12838, MYPOJHUMAN, 
j AF049498J, GEN 14531, P W14146, HS46KDA 1, CINB RAT, OX2G_RAT, D87018J, and D86996_2. 

■ Clone DNA62814-1521 was deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203093. 

35 
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EXAMPLE 126: Isolation of cDNA clones Encoding Human PRO 11 60 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above This consensus sequence is herein designated DNA40650. Based on the DNA40650 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PRO1160. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-GCTCCCTGATCTTCATGTCACCACC-3' (SEQ ID NO:395) 
reverse PCR primer 5 ' -C AGGG AC AC ACTCTACC ATTCGGG AG-3 ' (SEQ ID NO:396) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40650 
10 sequence which had the following nucleotide sequence 
hybridization probe 

5'-CCATCTTTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTC-3' (SEQ ID NO:397) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PROl 160 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human breast tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 160 (designated herein as DNA62872-1509 [Figure 281, SEQ ID NO: 3931) and the derived protein 
sequence for PROl 160. 

20 The entire nucleotide sequence of DNA62872-I509 is shown in Figure 281 (SEQ ID NO:393). Clone 

DNA62872-I509 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 40-42 and ending at the stop codon at nucleotide positions 310-312 (Figure 281). The predicted 
polypeptide precursor is 90 amino acids long (Figure 282). The full-length PROl 160 protein shown in Figure 
282 has an estimated molecular weight of about 9,039 daltons and a pi of about 4.37. Analysis of the full-length 

25 PROl 160 sequence shown in Figure 282 (SEQ ID NO:394) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 19 and a protein kinase C phosphorylation site from about 
amino acid 68 to about amino acid 70. Clone DNA62872-1509 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203100. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 282 (SEQ ID NO:394), evidenced significant 
homology between the PROl 160 amino acid sequence and the following Dayhoff sequences: B3O305, 
GEN13490, 153641, S53363, HA34 BRELC, SP96JDICDI, S36326, SSU51197J0, MUCIJCENLA, 
TCU32448J and AF000409J. 

35 EXAMPLE 127 : Isolation of cDNA clones Encoding Human PROl 187 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
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expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
5 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57726. 

In light of an observed sequence homology between the DNA57726 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 358563, the Incyte EST clone 358563 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

10 The sequence of this cDNA insert is shown in Figure 283 and is herein designated as DNA62876-1517. 

The full length clone shown in Figure 283 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 121-123 and ending at the stop codon found at nucleotide 
positions 481-483 (Figure 283; SEQ ID NO:398). The predicted polypeptide precursor (Figure 284, SEQ ID 
NO: 399) is 120 amino acids long. The signal peptide is at about amino acids 1-17 of SEQ ID NO: 399. 

15 PRO 11 87 has a calculated molecular weight of approximately 12,925 daltons and an estimated pi of 
approximately 9.46. Clone DNA62876-1517 was deposited with the ATCC on August 4, 1998 and is assigned 
ATCC deposit no. 203095. It is understood that the deposited clone contains the actual sequence and that the 
representations herein may have minor sequencing errors. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 284 (SEQ ID NO: 399), revealed some sequence 
identity (and therefore some relation) between the PROl 187 amino acid sequence and the following Dayhoff 
sequences: MGNENDOBX 1 , CELF41G3 9, AMPG STRLI, HSBBOVHERL 2, LEEXTEN101, 
AF029958 1 and P W04957. 



25 EXAMPLE 128 : Isolation of cDNA clones Encoding Human PROl 185 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

30 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56426. 

35 In light of an observed sequence homology between the DNA56426 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 32844 11 , the Incyte EST clone 328441 1 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 285 and is herein designated as DNA62881-1515. 

The full length DNA6288 1-1515 clone shown in Figure 285 contained a single open reading frame with 
an apparent translational initiation site at nucleotide positions 4-6 and ending at the stop codon found at 
nucleotide positions 598-600 (Figure 285; SEQ 10 NO:400). The predicted polypeptide precursor (Figure 286, 
SEQ ID NO:401) is 198 amino acids long. The signal peptide is at about amino acids 1-21 of SEQ ID NO:401. 
5 PRO 11 85 has a calculated molecular weight of approximately 22,105 daltons and an estimated pi of 
approximately 7.73. Clone DNA62881-1515 has been deposited with the ATCC and is assigned ATCC deposit 
no. 203096. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 286 (SEQ ID NO:401), revealed some sequence 
10 identity between the PRO 11 85 amino acid sequence and the following Dayhoff sequences: TUP1YEAST, 
AF041382_1, MAOM SOLTU, SPPBPHU9_ 1,14 1024, EPCPLCFAILJ, HSPLECJ, YKL4 CAEEL, 
A44643, TGU65922_1. 

EXAMPLE 129 : Isolation of cDNA clones Encoding Human PRO 1345 
15 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA47364. Based on the DNA47364 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01345. 

20 PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5'-CCTGGTTATCCCCAGGAACTCCGAC-3' (SEQ ID NO:404) 
reverse PCR primer 5 ' -CTCTTGCTGCTGCG AC AGGCCTC -3 ' (SEQ ID NO:405) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47364 
sequence which had the following nucleotide sequence 
25 hybridization probe 

5-CGCCCTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGAC-3' (SEQ ID NO:406) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PRO 1345 gene using the probe oligonucleotide and one of the PCR primers. RNA 
30 for construction of the cDNA libraries was isolated from human breast carcinoma tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01345 (designated herein as DNA64852-1589 [Figure 287, SEQ ID NO:402J) and the derived protein 

sequence for PRO 1345. 

The entire nucleotide sequence of DNA64852-1589 is shown in Figure 287 (SEQ ID NO:402). Clone 
35 DNA64852-1589 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 7-9 or 34-36 and ending at the stop codon at nucleotide positions 625-627 (Figure 287). The predicted 
polypeptide precursor is 206 amino acids long (Figure 288). The full-length PR01345 protein shown in Figure 
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288 has an estimated molecular weight of about 23, 190 daltons and a pi of about 9.40. Analysis of the full- 
length PR01345 sequence shown in Figure 288 (SEQ ID NO:403) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 31 or from about amino acid 10 to about amino acid 
31 and a C-type lectin domain signature sequence from about amino acid 176 to about amino acid 190. Clone 
DNA64852-1589 has been deposited with ATCC on August 18,1 998 and is assigned ATCC deposit no. 203 127. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 288 (SEQ ID NO: 403), evidenced significant 
homology between the PR01345 amino acid sequence and the following Dayhoff sequences: BTU22298 1, 
TETN_CARSP, TETNJE-IUMAN, MABARAT, S34198, P_W13144, MACMBPAJ, A46274, PSPD RAT 
AND P R32188. 



EXAMPLE 130 : Isolation of cDNA clones Encoding Human PRO 1245 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

15 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

20 obtained therefrom is herein designated DNA56019. 

In light of an observed sequence homology between the DNA56019 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1327836, the Incyte EST clone 1327836 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 289 and is herein designated as DNA64884-1527. 

25 The full length clone shown in Figure 289 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 79-81 and ending at the stop codon found at nucleotide positions 
391-393 (Figure 289; SEQ ID NO:407). The predicted polypeptide precursor (Figure 290, SEQ ID NO:408) 
is 104 amino acids long, with a signal peptide sequence at about amino acid 1 to about amino acid 18. 
PRO 1245 has a calculated molecular weight of approximately 10,100 daltons and an estimated pi of 

30 approximately 8.76. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 290 (SEQ ID NO:408), revealed some homology 
between the PRO 1245 amino acid sequence and the following Dayhoff sequences: SYAJTHETH, GEN1 11 67, 
MTV044_4, AB011151_1, RLAJ2750_3, SNELIPTRAJ, S63624, C28391, A37907, and S14064. 

35 Clone DNA64884-1245 was deposited with the ATCC on August 25, 1998 and is assigned ATCC 

deposit no. 203155. 
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EXAMPLE 131 : Isolation of cDNA clones Encoding Human PRO 1358 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
10 encompassed within the Incyte EST clone no. 88718, the Incyte EST clone 887 18 was purchased and the cDNA 
insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The sequence 
of this cDNA insert is shown in Figure 291 and is herein designated as DNA64890-1612. 

The full length clone shown in Figure 291 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86 through 88 and ending at the stop codon found at nucleotide 
15 positions 1418 through 1420 (Figure 291; SEQ ID NO:409), The predicted polypeptide precursor (Figure 292, 
SEQ ID NO:410) is 444 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID NO:410. 
PRO 1358 has a calculated molecular weight of approximately 50,719 daitons and an estimated pi of 
approximately 8.82. Clone DNA64890-1612 was deposited with the ATCC on August 18, 1998 and is assigned 
ATCC deposit no. 203131, 

20 An analysis of the Dayhoff database (version 35,45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 292 (SEQ ID NO :4 10), revealed sequence identity 
between the PROI358 amino acid sequence and the following Dayhoff sequences: P_W07607, AB000545_1, 
AB000546J, A1ATJIAT, AB015164_1,P_P50021, COTRCAVPO, and HAMHPP 1. The variants claimed 
in this application exclude these sequences. 

25 

EXAMPLE 132 : Isolation of cDNA clones Encoding Human PROH95 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

30 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies- The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in 
Enzvmology 266:460-480 (1996^). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

35 obtained therefrom is herein designated DNA55716. 

In light of an observed sequence homology between the DNA55716 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3252980, the Incyte EST clone 3252980 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 293 and is herein designated as DNA654 12-1523. 

The full length clone shown in Figure 293 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 58-60 and ending at the stop codon found at nucleotide positions 
511-513 (Figure 293; SEQ ID NO:411). The predicted polypeptide precursor (Figure 294, SEQ ID NO:412) 
5 is 151 amino acids long. The signal sequence is at about amino acids 1-22 of SEQ ID NO:412. PROl 195 has 
a calculated molecular weight of approximately 17,277 daltons and an estimated pi of approximately 5.33. Clone 
DNA65412-1523 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203094. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 294 (SEQ ID NO:412), revealed some sequence 
10 identity between the PROl 195 amino acid sequence and the following Dayhoff sequences: MMU28486 1, 
AF044205_1, P W31186, CELK03C7J, F69034, EF1A METVA, AF024540J, SSU90353 1, 
MRSP STAAU and P R97680. 

EXAMPLE 133 : Isolation of cDNA clones Encoding Human PRO 1270 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

20 Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DN A5795 1 . 

In light of an observed sequence homology between the DN A5795 1 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. 124878, the Merck EST clone 124878 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 295 and is herein designated as DNA66308-1537. 

Clone DNA66308-1537 contains a single open reading frame with an apparent translational initiation 
site at nucleotide position 103-105 and ending at the stop codon at nucleotide positions 1042-1044 (Figure 295). 

30 The predicted polypeptide precursor is 313 amino acids long (Figure 296), The full-length PRO 1270 protein 
shown in Figure 296 has an estimated molecular weight of about 34,978 daltons and a pi of about 5 J 1 . Analysis 
of the full-length PROl 270 sequence shown in Figure 296 (SEQ ID NO:414) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 16, a potential N-glycosylation site from 
about amino acid 163 to about amino acid 166 and glycosaminoglycan attachment sites from about amino acid 

35 74 to about amino acid 77 and from about amino acid 289 to about amino acid 292. Clone DNA66308-1537 has 
been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no, 203159. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 296 (SEQ ID NO:414), evidenced significant 
homology between the PRO 1270 amino acid sequence and the following Dayhoff sequences: XLU86699_1, 
S49589, FIBAPARPA, FIBBHUM AN, PJR47 189, AF004326 1 1 DRTEN ASCN_1 , AF004327_1 , P_W0141 1 
and FIBG_BOVIN. 

EXAMPLE 134 : Isolation of cDNA clones Encoding Human PRO 1271 
■§< Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

H EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
10 EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
15 obtained therefrom is herein designated DNA57955. 

In light of an observed sequence homology between the DNA57955 consensus sequence and an EST 
i| sequence encompassed within the Merck EST clone no, AA625350, the Merck EST clone AA625350 was 

^ purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 

protein. The sequence of this cDNA insert is shown in Figure 297 and is herein designated as DNA66309-1538. 
20 Clone DNA66309-1538 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 94-96 and ending at the stop codon at nucleotide positions 718-720 (Figure 297). The 
predicted polypeptide precursor is 208 amino acids long (Figure 298). The full-length PRO 1271 protein shown 
in Figure 298 has an estimated molecular weight of about 21 ,53 1 daltons and a pi of about 8.99. Analysis of 
the full-length PRO 1271 sequence shown in Figure 298 (SEQ ID NO:416) evidences the presence of the 
25 following: a signal peptide from about amino acid 1 to about amino acid 31 and a transmembrane domain from 
about amino acid 166 to about amino acid 187. Clone DNA66309-1538 has been deposited with ATCC on 
September 15, 1998 and is assigned ATCC deposit no. 203235. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
$ alignment analysis of the full-length sequence shown in Figure 298 (SEQ ID NO:416), evidenced significant 

30 homology between the PRO 1271 amino acid sequence and the following Dayhoff sequences: S57180, S63257, 
AGA1YEAST, BPU43599 1, YS8A CAEEL, S67570, LSU54556 2, S70305, VGLX HSVEB, and 
- D88733J. 

EXAMPLE 135 : Isolation of cDNA clones Encoding Human PRQ1375 
35 A Merck/ Wash. U. database was searched and a Merck EST was identified. This sequence was then 

put in a program which aligns it with other seequences from the Swiss-Prot public database, public EST 
V- databases (e.g., GenBank, Merck/Wash. U.), and a proprietary EST database (LIFESEQ®, Incyte 
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Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
[Altschul et al., Methods in Enzvmoiogy. 266:460-480 (1996)] as a comparison of the extracellular domain 
(ECD) protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a 
BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
; ff 5 Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. This 
H consensus sequence is designated herein "DNA67003", 

Based on theDNA67003 consensus sequence, the nucleic acid (SEQ ID NO:4I7) was identified in a 
human pancreas library. DNA sequencing of the clone gave the full-length DNA sequence for PRO 1375 and 
10 the derived protein sequence for PRO 1375. 

The entire coding sequence of PR01375 is shown in Figure 299 (SEQ ID NO:4 17). Clone DNA67004- 
1614 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
104-106 and an apparent stop codon at nucleotide positions 698-700 of SEQ ID NO:417. The predicted 
polypeptide precursor is 198 amino acids long. The transmembrane domains are at about amino acids 11-28 
15 (type II) and 103-125 of SEQ ID NO:418. Clone DNA67004-1614 has been deposited with ATCC and is 
assigned ATCC deposit no. 2031 15. The full-length PRO 1375 protein shown in Figure 300 has an estimated 
molecular weight of about 22,531 daltons and a pi of about 8.47. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 300 (SEQ ID NO:4 1 8), revealed sequence identity 
20 between the PRO 1 375 amino acid sequence and the following Dayhoff sequences : AF026 1 98 5 , CELR 1 2C 1 2 5 , 
S73465, Y011MYCPN, S64538_l, P_P8150, MUVSHPO10 L VSHJvfUMPL and CVU59751 5. 

EXAMPLE 136 : Isolation of cDNA clones Encoding Human PRO 1385 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. f GenBank) and a proprietary 
EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmoiogy 266:460 4g? (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap*' (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57952. 

In light of an observed sequence homology between the DNA57952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3129630, the Incyte EST clone 3129630 was purchased 
35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 301 and is herein designated as DNA68869-1610. 
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Clone DNA68869-1610 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 26-28 and ending at the stop codon at nucleotide positions 410-412 (Figure 301). The 
predicted polypeptide precursor is 128 amino acids long (Figure 302), The full-length PRO 1385 protein shown 
in Figure 302 has an estimated molecular weight of about 13,663 daltons and a pi of about 10.97. Analysis of 
the full-length PRO 1385 sequence shown in Figure 302 (SEQ ID NO:420) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 28 , and glycosylaminoglycan attachment 
sites from about amino acid 82 to about amino acid 85 and from about amino acid 91 to about amino acid 94. 
Clone DNA68869-1610 has been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 
203164. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 302 (SEQ ID NO: 420), evidenced low homology 
between the PR01385 amino acid sequence and the following Dayhoff sequences: CELT14A8_1, 
LMNACHRA1 1, HXD9_HUMAN, CHKCMLF l, HS5PP34 2, DMDRING 1, A37107 1, 
MMLUNGENEJ, PUMDROME and DMU251 17 J . 



15 EXAMPLE 137 : Isolation of cDNA clones Encoding Human PRO 1 387 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The 

20 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is hcjz'm designated DNA56259. 

25 In light of an observed sequence homology between the DNA56259 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3507924, the Incyte EST clone 3507924 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 303 and is herein designated as DNA68872-1620. 

Clone DNA688 7 2- 1620 contains a single open reading frame with an apparent translational initiation 

30 site at nucleotide positions 85-87 and ending at the stop codon at nucleotide positions 1267-1269 (Figure 303). 
The predicted polypeptide precursor is 394 amino acids long (Figure 304). The full-length PRO 1387 protein 
shown in Figure 304 has an estimated molecular weight of about 44 ,339 daltons and a pi of about 7.10. Analysis 
of the full-length PRO 13 87 sequence shown in Figure 304 (SEQ ID NO: 422) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19, a transmembrane domain from about 

35 amino acid 275 to about amino acid 296, potential N-glycosylation sites from about amino acid 76 to about amino 
acid 79, from about amino acid 231 to about amino acid 234, from about amino acid 302 to about amino acid 
305, from about amino acid 307 to about amino acid 310 and from about amino acid 376 to alout amino acid 
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379, and amino acid sequence blocks having homology to myelin pO protein from about amino acid 210 to about 
amino acid 239 and from about amino acid 92 to about amino acid 121. Clone DNA68872-1620 has been 
deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203160. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 304 (SEQ ID NO: 422), evidenced significant 
5 homology between the PRO 1387 amino acid sequence and the following Dayhoff sequences: P_W36955, 
MYP0HETFR, HS46KDA 1, AF049498 i, MYO0_HUMAN, AF030454J, A53268, SHPTCRA 1, 
PW14146 and GEN 1 2838. 

EXAMPLE 138 : Isolation of cDNA clones Encoding Human PRO 1384 
10 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA54192. Based on the DNA54192 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1384. 

PCR primers (forward and reverse) were synthesized: 
15 forward PCR primer 5'-TGCAGCCCCTGTGACACAAACTGG-3' (SEQ ID NO:425) 
reverse PCR primer 5'-CTGAGATAACCGAGCCATCCTCCCAC -3' (SEQ ID NO:426) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA54192 
sequence which had the following nucleotide sequence: 
hybridization probe 

20 5 ' -GGAG ATAGCTGCT ATGGGTTCTTC AGGC AC AACTT A AC ATGGG AAG-3 * (SEQ ID NO:427) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PRO 13 84 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver. 
25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01384 (designated herein as DNA71159-1617 [Figure 305, SEQ ID NO:423]; and the derived protein 

sequence for PRO 1384. 

The entire coding sequence of PR01384 is shown in Figure 305 (SEQ ID NO:423). Clone DNA7 i 159- 
1617 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 

30 1 82- 1 84 and an apparent stop codon at nucleotide positions 869-87 1 . The predicted polypeptide precursor is 229 
amino acids long. The full-length PRO 1384 protein shown in Figure 306 has an estimated molecular weight of 
about 26,650 daltons and a pi of about 8.76. Additional features include a type II transmembrane domain at 
about amino acids 32-57, and potential N-glycosylation sites at about amino acids 68-71 , 120-123, and 134-137, 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 306 (SEQ ID NO: 424), revealed homology 
between the PR01384 amino acid sequence and the following Dayhoff sequences: AF054819_1, HSAJ1687_1, 
AF009511J,ABOlO71O_l,GEN13595,HSAJ673 1, GEN 1 396 1 , AB005900 1, LECH CHICK, AF021 349 1, 
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and NK13RAT. 

Clone DNA71 159-1617 has been deposited with ATCC and is assigned ATCC deposit no. 203135. 

EXAMPLE 139 : Use of PRO as a hybridization probe 

The following method describes use of a r»'Heotide sequence encoding PRO as a hybridization probe. 
5 DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is employed as 

a probe to screen for homologous DN As (such as those encoding naturally-occurring variants of PRO) in human 
tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library ONAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a 
10 solution of 50% formamide, 5x SSC, 0. 1 % SDS, 0. 1 % sodium pyrophosphate, 50 rnM sodium phosphate, pH 
6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed 
in an aqueous solution of 0. lx SSC and 0. 1 % SDS at 42°C. 

DN As having a desired sequence identity with the DNA encoding full-length native sequence PRO can 
then be identified using standard techniques known in the art. 

15 

EXAMPLE 140 : Expression of PRO in E. coli 

This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in 

E. coli. 

The DNA sequence encoding PRO is initially amplified using selected PCR primers. The primers 
20 should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coli; see Bolivar et al., Gene, 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode 
25 for an antibiotic resistance gene, a tip promoter, a polyhis leader (including the first six STII codons, polyhis 
sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an 
argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described in 
Sambrook et al. , supra . Transformants are identified by their ability to grow on LB plates and anuoiotic resistant 
30 colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 
35 After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 

pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the solubilized 
PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the 
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protein. 

PRO may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA 
encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
# 5 column, and proteolytic removal with enterokinase. The PCR-amplifted, poly-His tagged sequences are then 

ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 
fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants are first grown in LB containing 50 mg/ml 
^ carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into 

CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium citrate»2H20, L07 g KC1, 5.36 g Difco 

10 yeast extract, 5.36 g Sheffieldiiycase SF in 500 mL water, as well as 1 10 mM MPOS, pH 7.3, 0.55% (w/v) 
glucose and 7 mM MgS0 4 ) and grown for approximately 20-30 hours at 30 °C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 

15 guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0. 1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step results 
: 0 in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 

rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 

20 clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm 
using the calculated extinction coefficient based on its amino acid sequence. 

25 The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 

of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. 
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition 
of TFA to a final concentrate^ of 0.4 % (pH of approximately 3). Before further purification of the protein, the 

30 solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
£ refolded protein is chromatographed on a Poros Ri/H reversed phase column using a mobile buffer of 0.1% 

TFA with elution with a gradient of acetonitrile from 10 to 80% . Aliquots of fractions with A280 absorbance 
are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 

35 since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin 
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from the samples. 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6,8 with 
0. 14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



$4 

>?? EXAMPLE 141 : Expression of PRO in mamma lian cells 



This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
expression in mammalian cells. 
10 The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 

Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook et al., supra . The resulting vector is called pRK5- 
PRO. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 

15 grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 pRK5~PRO DNA is mixed with about 1 fig 
DNA encoding the VA RNA gene [Thimmappaya et al., Cell, 31:543 (1982)) and dissolved in 500 /d of 1 mM 
Tris-HCI, 0.1 mM EDTA, 0.227 M CaCl 2 . To this mixture is added, dropwise, 500 y\ of 50 mM HEPES (pH 
7.35), 280 mM NaCI, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 

20 precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the trans fections, the culture medium is removed and replaced with culture 
medium (alone) or culture medium containing 200 fiCi/m\ 35 S cysteine and 200 fiCl/ml 35 S-metnionine. After 

25 a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15 % 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in 
serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiently using the ucxtran sulfate 

30 method described by Somparyrac et al., Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 fig pRK5-PRO DNA is added. The cells are first concentrated from 
the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 jug/ml bovine insulin and 

35 0. i /ig/ml bovine transferrin. After about four days, the conditioned media is centrifuged and filtered to remove 
cells and debris. The sample containing expressed PRO can then be concentrated and purified by any selected 
method, such as dialysis and/or column chromatography. 
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In another embodiment, PRO can be expressed in CHO cells. The pRK5~PRO can be transfected into 
CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the cell cultures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 
35 S-methionine. After determining the presence of PRO polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium 
5 is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected 
method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
his tag into a Baculovirus expression vector. The poly-his tagged PRO insert can then be subcloned into a SV40 

10 driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
above, to verify expression. The culture medium containing the expressed poly-His tagged PRO can then be 
concentrated and purified by any selected method, such as by Ni :+ -chelate affinity chromatography. 

PRO may also be expressed in ChO and/or COS cells by a transient expression procedure or in CHO 

15 cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 and 
CH2 domains and/or is a poly-His tagged form. 

20 Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 

standard techniques as described in Ausubel et al., Current Protocols of Molecular Biology, Unit 3.16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells 
is as described in Lucas et al M Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 

25 promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 rJllion CHO cells 
using commercially available transfection reagents Superfect* (Quiagen), Dosper* or Fugene* (Boehringer 
Mannheim). The cells a^e grown as described in Lucas et ah, supra . Approximately 3 x 10" 7 cells are frozen 

30 in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
ixm filtered PS20 with 5% 0.2 ^m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 

35 spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 37°C. After another 2-3 days, 250 mL, 500 mL 
and 2000 mL spinners are seeded with 3 x 10 5 cells/mL, The cell media is exchanged with fresh media by 
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centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 
3L production spinner is seeded at 1 .2 x 10 6 cells/mL. On day 0, the cell number pH ie determined. On day 
1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the 
temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% 
5 poiydimethylsiloxane emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70% , the cell culture is harvested by centrifugation and filtering through a 0.22 /jm filter. The filtrate was either 
stored at 4°C or immediately loaded onto columns for purification. 

For the poIy-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 

10 purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0. 14 M NaCl and 4% 

15 mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C, 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer » pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 

20 ml fractions into tubes containing 275 //L of I M Tris buffer, pH 9. The highly purified protein is subsequently 
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed 
by SDS poly aery lamide gels and by N-terminal amino acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



25 EXAMPLE 142 : Expression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO from 
the ADH2/GAPDH promoter. DN A encoding PRO and the promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of PRO, For secretion, DNA encoding PRO can 

30 be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO 
signal peptide or other mammalian signal peptide, or, for example, a yeast alpha- factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain AB 1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by 

35 precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 
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Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

S 5 EXAMPLE 143 : Expression of PRO in Baculovirus-Infected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus- infected insect cells. 
r| The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 

expression vector. Such epitope tags include poly-his tags and irnmun jglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 
> 10 as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the coding sequence of 

PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5' and 3' regions. The 5* primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 
15 Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGoId™ virus 

DNA (Pharmingen) into Spodoptera frugiperda ("SffT) cells (ATCC CRL 1711) using lipofectin (commercially 
£ 4, available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are harvested and used 

for further amplifications. Viral infection and protein expression are performed as described by O'Reiliey et 
al., Baculovirus expression vectors: A Laboratory Manual, Oxford: Oxford University Press (1994). 
20 Expressed poly-his tagged PRO can then be purified, for example, by NF + -chelate affinity 

chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert et al., Nature , 362 :175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0.1 mM EDTA; 10% glycerol; 0. 1 % NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 
25 in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 jum 
filter. A Ni- + -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 
mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the column at 0.5 mL per minute. The column is washed to baseline A 280 with loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 
30 phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching 
A 28 o baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His l0 -tagged PRO are 
pooled and dialyzed against loading buffer. 
35 Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 

chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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EXAMPLE 144 ; Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO, 

Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra . Immunogens that may be employed include purified PRO, fusion proteins containing 
PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can be made by 
5 the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
10 additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
15 are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non- fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO, Determination of 
20 "positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
25 affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

EXAMPLE 145 : Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the art 

of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide 
30 is purified by immunoaffmity chromatography using antibodies specific for the PRO polypeptide of interest. In 

general, an immunoaffimty column is constructed by covalently coupling the anti-PRO polypeptide antibody to 

an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 

sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.)- 
35 Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 

chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 

chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
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is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an immunoaffinity column is utilized in the purification of PRO polypeptide by preparing a fraction 
from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by 
5 other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypeptide -containing preparation is passed over the immunoaffinity column, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g., high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
10 antibody /PRO polypeptide binding (e.g. , a low pH buffer such as approximately pH 2-3, or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

EXAMPLE 146 : Drug Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 

15 fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellular ly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 

20 used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent with 

25 an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and 
the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment 
and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or 
fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular 

30 agent to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 

35 with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the art. 
Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
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solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with PRO polypeptide. 

* 5 

EXAMPLE 147 : Rational Drug Design 
S The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 

interest (i.e. , a PRO polypeptide) or of small molecules with which they interact, e.g. , agonists, antagonists, or 
inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the 
1 0 PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo (c.f. , Hodgson, 
Bio/Technology. 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO 
polypeptide -inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, 
by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must be ascertained 
15 to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding 
^ the structure of the PRO polypeptide may be gained by modeling based on the structure of homologous proteins, 

y; In both cases, relevant structural information is used to design analogous PRO polypeptide-like molecules or to 

identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved 
activity or stability as shown by Braxton and Wells, Biochemistry, 31:7796-7801 (1992) or which act as 
20 inhibitors, agonists, or antagonists of native peptides as shown by Athauda et aL, J. Biochem, . 113:742-746 
(1993), 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 
and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating anti -idiotypic 
25 antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, 
the binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then 
be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as the pharmacore. 

By virtue or tne present invention, sufficient amounts of the PRO polypeptide may be made available 
30 to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO polypeptide 
| amino acid sequence provided herein will provide guidance to those employing computer modeling techniques 

in place of or in addition to x-ray crystallography. 
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Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 10801 
University Blvd., Manassas, VA 20110-2209, USA (ATCC): 



Table 2 



10 



•is 



25 



30 



35 



40 



45 



50 



Material 


ATCC Dep. No. 


Deposit D 


DNA16422-1209 


209929 


June 2, 1998 


DNA16435-1208 


209930 


June 2, 1998 


DNA21624-1391 


209917 


June 2, 1998 


DNA23334-1392 


209918 


June 2, 1998 


DNA26288-1239 


209792 


April 21, 1998 


DNA26843-1389 


203099 


August 4, 1998 


DNA26844-1394 


209926 


June 2, 1998 


DNA30862-1396 


209920 


June 2, 1998 


DNA35680-1212 


209790 


April 21, 1998 


DNA40621-1440 


209922 


June 2, 1998 


DNA44161-1434 


209907 


May 27, 1998 


DNA44694-1500 


203114 


August 11, 1998 


DNA45495-1550 


203156 


August 25, 1998 


DNA47361-H54 


209431 


November 7, 1997 


DNA47394-1572 


203109 


August 11, 1998 


DNA48320-1433 


209904 


May 27, 1998 


DNA48334-1435 


209924 


June 2, 1998 


DNA48606-1479 


203040 


July I, 1998 


DNA49141-1431 


203003 


June 23, 1998 


DNA491 42-1430 


203002 


June 23, 1998 


DNA49 143- 1429 


203013 


June 23, 1998 


DNA49647-1398 


209919 


June 2, 1998 


DNA49819-1439 


209931 


June 2, 1998 


DNA49820-1427 


209932 


June 2, 1998 


DNA4982 1-1562 


209981 


June 16, 1998 


DN A52 1 92- 1 369 


203042 


July 1, 1998 


DNA52598-1518 


203107 


August 1 1, 1998 


DNA53913-1490 


203162 


August 25, 1998 


DNA53978-1443 


209983 


June 16, 1998 


Urn ADJyyo-1442 


zuyyz 1 


June z, lyyo 


DNA56041-1416 


203012 


June 23, 1998 


DNA56047-1456 


209948 


June 9, 1998 


DNA56050-1455 


203011 


June 23, 1998 


DNA56 110-1437 


203113 


August 11, 1998 


DNA561 13-1378 


203049 


July 1, 1998 


DNA56410-1414 


209923 


June 2, 1998 


DNA56436-1448 


209902 


May 27, 1998 


DNA56855-1447 


203004 


June 23, 1998 


DNA56859-1445 


203019 


June 23, 1998 


DNA56860-1510 


209952 


June 9, 1998 


DNA56865-1491 


203022 


June 23, 1998 


DNA56866-1342 


203023 


June 23, 1998 


DNA56868-1209 


203024 


June 23, 1998 


DNA56869-1545 


203161 


August 25, 1998 


DNA56870-1492 


209925 


June 2, 1998 


DNA57033-1403 


209905 


May 27, 1998 


DNA57037-1444 


209903 


May 27, 1998 


DNA57129-1413 


209977 


June 16, 1998 
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55 



DN A57690- 1 374 


209950 


June 9, 1998 


DNA57693-1424 


203008 


June 23, 1998 


rv\J * CI £1C\A 11/11 

DNA57694-1341 


203017 


June 23, 1998 


DNA57695-1340 


203006 


June 23, 1998 


DNA57699- 14 1 2 


203020 


June 23, 1998 


UNAD/ /Uz-14/0 


20995 1 


June 9, 1998 


DN A57704- 1452 


209953 


June 9, 1998 


UN AD / /Uo- 1411 


203021 


June 23, 1998 


UNA!)// 1U-14D 1 


203048 


July 1, 1998 


r\\1 A CT7 11 1 £7/\ 1 

DNA5771 1-1501 


203047 


July 1, 1998 


DNA57827- 1493 


203045 


July 1, 1998 


DNA57834-1339 


209954 


June 9, 1998 


DNA57836-1338 


203025 


June 23, 1998 


DNA57838-1337 


203014 


June 23, 1998 


DNA57844-1410 


203010 


June 23, 1998 


DNA5872 1-1475 


203110 


August 11, 1998 


DNA58723-1588 


203133 


August 18, 1998 


DNA58737-1473 


203136 


August 18, 1998 


DNA58743-1609 


203154 


August 25, 1998 


DNA58846-1409 


209957 


June 9, 1998 


DNA58848-1472 


209955 


June 9, 1998 


r\ VTA r A A 4 /"\ 4 J a J 

DNA58849-1494 


209958 


June 9, 1998 


DNA58850-1495 


209956 


June 9, 1998 


DNA58853-1423 


203016 


June 23, 1998 


DNA58855-1422 


203018 


June 23, 1998 


DNA59205-1421 


203009 


June 23, 1998 


DNA592 11-1450 


209960 


June 9, 1998 


DNA59213-1487 


209959 


June 9, 1998 


DNA59214-1449 


203046 


July 1, 1998 


DNA59215-1425 


209961 


June 9, 1998 


DNA59220-1514 


209962 


June 9, 1998 


DNA59488-1603 


203157 


August 25, 1998 


DNA59493-1420 


203050 


July 1, 1998 


DNA59497-1496 


209941 


June 4, 1998 


DNA59588-1571 


203106 


August 11, 1998 


DNA59603-1419 


209944 


June 9, 1998 


DNA59605-1418 


203005 


June 23, 1998 


UN AjvoOo- 14/1 


209945 


June 9, 1998 


UN AZ>yoU7- 1497 


209957 


June 9, 1998 


UN ADyOUy- 14 /U 


209963 


June 9, 1998 


t"\kta^o^ia 1 <<:fi 
UNAjVdIU-OjV 


209990 


June 16, 1998 


JJIN A3 yo i Z- 1 400 


209947 


June 9, 1998 


l^\KT A <;A/C 11 1 ,4 1 *7 


203007 


June 23, 1998 


UN AjVo lo- 1465 


209991 


June 16, 1998 


n\T A Cfi/C 1 n 1 AHA 

un Ar>yoiy-i4o4 


203041 


July 1, 1998 


T^KT A Cft£1A 1 /1£1 

UN A3yozt> 140.5 


209989 


June 16, 1998 




zuyyyz 


June 17, 1998 


DNA59767-1489 


203108 


August 11, 1998 


DNA59776-1600 


203128 


August 18, 1998 


DNA59777-1480 


203111 


August 11, 1998 


DNA59820-1549 


203129 


August 18, 1998 


DNA59827-1426 


203089 


August 4, 1998 


DNA59828-1608 


203158 


August 25, 1998 


DNA59838-1462 


209976 


June 16, 1998 


DNA59839-1461 


209988 


June 16, 1998 


DNA5984 1-1460 


203044 


July 1, 1998 


DNA59842-1502 


209982 


June 16, 1998 
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DNA59846-1503 


209978 


June 16, 1998 




DNA59847-1511 


203098 


August 4, 1998 




DNA59848-1512 


203088 


August 4, 1998 




DNA59849-1504 


209986 


June 16, 1998 




DNA59853-1505 


209985 


June 16, 1998 


5 


DNA59854-1459 


209974 


June 16, 1998 




DNA60283-1484 


203043 


July 1, 1998 




DNA60615-1483 


209980 


June 16, 1998 




DNA60619-1482 


209993 


June 16, 1998 




DNA60621-1516 


203091 


August 4, 1998 


10 


DNA60622-1525 


203090 


August 4, 1998 




DNA60625-1507 


209975 


June 16, 1998 




DNA6O627-1508 


203092 


August 4, 1998 




DNA60629-1481 


209979 


June 16, 1998 




DNA61755-1554 


203112 


August 11, 1998 


15 


DNA6 1873- 1574 


203132 


August 18, 1998 




DNA62814-1521 


203093 


August 4, 1998 




DNA62872-1509 


203100 


August 4, 1998 




DNA62876-1517 


203095 


August 4, 1998 




DNA62881-1515 


203096 


August 4, 1998 


20 


DNA64852-1589 


203127 


August 18, 1998 




DNA64884-1527 


203155 


August 25, 1998 




DNA64890-1612 


203131 


August 18, 1998 




DNA65412-1523 


203094 


August 4, 1998 


25 


DNA66308-1537 


203159 


August 25, 1998 


DNA66309-1538 


203235 


September 15, 1998 




DNA67004-16I4 


203115 


August 11, 1998 




DNA68869-1610 


203164 


August 25, 1998 




DNA68872-1620 


203160 


August 25, 1998 




DNA7 1159-1617 


203135 


August 18, 1998 


30 







These deposit were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 

35 between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 
USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1. 14 with particular reference to 

40 886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should 
die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
45 accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
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deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fall within the scope of the appended claims. 
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WHAT IS CLAIMED IS : 

1. Isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that 

encodes a polypeptide comprising an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 
(SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28), Figure 
5 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36), Figure 27 (SEQ ID NO:41), 
Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID 
NO:62), Figure 38 (SEQ ID NO;67), Figure 41 (SEQ ID NO:73), Figure 47 (SEQ ID NO:84), Figure 49 (SEQ 
ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 
(SEQ ID NO: 1 13), Figure 66 (SEQ ID NO: 1 15), Figure 68 (SEQ ID NO: 1 17), Figure 70 (SEQ ID NO: 1 19), 

10 Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID 
NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID NO: 148), Figure 88 (SEQ ID NO: 151), Figure 90 
(SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), 
Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167), Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ 
ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ ID NO: 177), Figure 113 (SEQ ID NO: 179), 

15 Figure 1 15 (SEQ ID NO: 181), Figure 1 17 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ 
ID NO: 194), Figure 125 (SEQ ID NO: 197), Figure 127 (SEQ ID NO: 199), Figure 129 (SEQ ID NO:201), 
Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207), Figure 137 (SEQ 
ID NO:209), Figure 139 (SEQ ID NO:211), Figure 141 (SEQ ID NO:213), Figure 144 (SEQ ID NO:216), 
Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ 

20 ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID NO:231), 
Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), Figure 165 (SEQ ID NO:246), Figure 167 (SEQ 
ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256), 
Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262), Figure 182 (SEQ 
ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268), Figure 188 (SEQ ID NO:270), 

25 Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276), Figure 196 (SEQ 
ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), 
Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 210 (SEQ 
ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), 
Figure 218 (SEQ ID NO:30I), Figure 220 (SEQ lu NO;303) t Figure 226 (SEQ ID NO:309), Figure 228 (SEQ 

30 ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), 
Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ 
ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID NO:355), 
Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), Figure 256 (SEQ ID NO:361), Figure 258 (SEQ 
ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), 

35 Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ 
ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381) 7 Figure 278 (SEQ ID NO:387), 
Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO;394), Figure 284 (SEQ ID NO:399), Figure 286 (SEQ 
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ID NO:401), Figure 288 <SEQ ID NO:403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID NO:410), 
Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416), Figure 300 (SEQ 
ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422) and Figure 306 (SEQ ID NO:424). 

2. The nucleic acid sequence of Claim 1 , wherein said nucleotide sequence comprises a nucleotide 
: % 5 sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO: 1), Figure 3 (SEQ 

ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 1 1 (SEQ ID NO: 19), Figure 14 (SEQ 
■g ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ ID NO:32), Figure 24 

If (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 (SEQ ID NO:51), Figure 

33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 40 (SEQ ID NO-.72), 
10 Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), Figure 52 (SEQ ID 
NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 1 12), Figure 65 (SEQ ID NO: 1 14), Figure 67 
(SEQ ID NO:l 16), Figure 69 (SEQ ID NO: 118), Figure 71 (SEQ ID NO: 123), Figure 73 (SEQ ID NO: 128), 
Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), Figure 84 (SEQ ID 
NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID NO: 155), Figure 94 
15 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 (SEQ ID NO: 166), 
Figure 102 (SEQ ID NO:168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID NO: 174), Figure 1 10 (SEQ 
Si ID NO: 176), Figure 112 (SEQ ID NO: 178), Figure 114 (SEQ ID NO: 180), Figure 116 (SEQ ID NO: 182), 

' Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID NO: 196), Figure 126 (SEQ 

ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 132 (SEQ ID NO:204), 
20 Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID NO:210), Figure 140 (SEQ 
ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 148 (SEQ ID NO:220), 
Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID NO:226), Figure 156 (SEQ 
ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 162 (SEQ ID NO:240), 
Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID NO:249), Figure 170 (SEQ 
25 ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 177 (SEQ ID NO:259), 
Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID NO:265), Figure 185 (SEQ 
ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 191 (SEQ ID NO:273), 
Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID NO:280), Figure 199 (SEQ 
# ID NO:282), Figure 20 1 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 205 (SFQ ID NO:288), 

30 Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 2 1 1 (SEQ ID NO:294), Figure 213 (SEQ 
; ; ID NO;296), Figure 215 (SEQ ID NO:298) T Figure 217 (SEQ ID NO:300), Figure 219 (SEQ ID NO:302), 

Figure 225 (SEQ ID NO:308) 4 Figure 227 (SEQ ID NO:3 13), Figure 229 (SEQ ID NO:318), Figure 232 (SEQ 
ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 239 (SEQ ID NO:344), 
Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID NO:350), Figure 247 (SEQ 
35 ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358), 
Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID NO:364), Figure 261 (SEQ 
" r - ID NO:366), Figure 263 (SEQ ID NO:368) t Figure 265 (SEQ ID NO:370), Figure 267 (SEQ ID NO:372), 
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Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ 
ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 281 (SEQ ID NO:393), 
Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ 
ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID NO:411), Figure 295 (SEQ ID NO;413), 
Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID NO:419), Figure 303 (SEQ 
* 5 ID NO;421) and Figure 305 (SEQ ID NO:423). 

'f 3 . The nucleic acid of Claim i , wherein said nucleotide sequence comprises a nucleotide sequence 

iff 

- ■ selected from the group consisting of the full-length coding sequence of the sequence shown in Figure 1 (SEQ 

ID NO:l), Figure 3 (SEQ IDNO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure II (SEQ ID 

10 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
1 (SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 

40 (SEQ ID NO:72), Figure 46 (SEQ ID NO;83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO; 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

15 NO: 1 14), Figure 67 (SEQ ID NO: 1 16), Figure 69 (SEQ ID NO: 118), Figure 71 (SEQ ID NO: 123), Figure 73 
v (SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 

% Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 

4i NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 

(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

20 NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

25 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 

30 NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 

35 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
^ 239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
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NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO;362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 

m NO:378), Figure 275 (SEQ ID NO:380), Figu™* ^77 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 

5 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID N0:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID NO:41 1), Figure 

|j 295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 2^9 (SEQ ID NO:4I7), Figure 301 (SEQ ID 

i NO:419), Figure 303 (SEQ ID NO:421) or Figure 305 (SEQ ID NO:423). 

10 4. Isolated nucleic acid which comprises the full-length coding sequence of the DNA deposited 

under any AT CC accession number shown in Table 2. 

5. A vector comprising the nucleic acid of Claim 1. 

15 6. The vector of Claim 5 operably linked to control sequences recognized by a host cell 

transformed with the vector. 

I 

7. A host cell comprising the vector of Claim 5. 
20 8. The host cell of Claim 7 wherein said cell is a CHO cell. 

9. The host cell of Claim 7 wherein said cell is an E. coli. 

10. The host cell of Claim 7 wherein said cell is a yeast cell. 

25 

11. A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from the 
cell culture. 

30 12. Isolated PRO polypeptide having at least 80% sequence identity to an amino acid sequence 

selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID NO: 2), Figure 4 (SEQ 
ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 (SEQ ID NO:14) 7 Figure 12 (SEQ ID NO:20), Figure 15 (SEQ 
ID NO:23), Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 
(SEQ ID NO:36), Figure 27 (SEQ ID NO:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO;52), Figure 

35 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), 
Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID 
NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID NO: 113), Figure 66 (SEQ ID NO: 1 15), Figure 68 
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(SEQ ID NO:l 17), Figure 70 (SEQ ID NO:119), Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), 
Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID 
NO: 148), Figure 88 (SEQ ID NO: 151), Figure 90 (SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 
(SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167), 
Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ 
5 ID NO: 177), Figure 113 (SEQ ID NO: 179), Figure 115 (SEQ ID NO: 181), Figure 117 (SEQ ID NO: 183), 
Figure 120 (SEQ ID NO:189), Figure 122 (SEQ ID NO:194), Figure 125 (SEQ ID NO:197), Figure 127 (SEQ 
ID NO:199), Figure 129 (SEQ ID NO:201), Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), 
Figure 135 (SEQ ID NO:207), Figure 137 (SEQ ID NO:209), Figure 139 (SEQ ID NO:21 1), Figure 141 (SEQ 
ID NO:2I3), Figure 144 (SEQ ID NO:216), Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), 

10 Figure * "1 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ 
ID NO:229), Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), 
Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ 
ID NO:253), Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), 
Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ 

15 ID NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), 
Figure 194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ 
ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289) t 
Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ 
ID NO:297), Figure 216 (SEQ ID NO:299), Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), 

20 Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ 
ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), 
Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ 
ID NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), 
Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ 

25 ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), 
Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ 
ID NO:381), Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), 
Figure 284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ 
ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), 

30 Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ 
ID NO:422) and Figure 306 (SEQ ID NO:424). 

13. Isolated PRO polypeptide having at least 80% sequence identity to the amino acid sequence 
encoded by a nucleic acid molecule deposited under any ATCC accession number shown in Table 2. 



35 



14. A chimeric molecule comprising a polypeptide according to Claim 12 fused to a heterologous 
amino acid sequence. 
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15. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is an 
epitope tag sequence . 

16. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is a Fc 
region of an immunoglobulin. 



17. An antibody which specifically binds to a PRO polypeptide according to Claim 12. 

If! 18. The antibody of Claim 17 wherein said antibody is a monoclonal antibody. 

10 19. The antibody of Claim 17 wherein said antibody is a humanized antibody. 

20. The antibody of Claim 17 wherein said antibody is an antibody fragment. 

21. An isolated nucleic acid molecule which has at least 80% sequence identity to a nucleic acid 
15 which comprises a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 

NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 
NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
20 40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 
NO: 114), Figure 67 (SEQ ID NO: 116), Figure 69 (SEQ ID NO: 118), Figure 71 (SEQ ID NO: 123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
25 NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 
NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID inO:202), Figure 
30 132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 
148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
35 NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
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191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 

| 5 NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 

239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 

| NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 

^ 253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO;362), Figure 259 (SEQ ID 

NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO;370), Figure 

?; . 10 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 

NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 

' ■ 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 

NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID NO:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
15 NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

s 

^ 22. An isolated nucleic acid molecule which has at least 80% sequence identity to the full-length 

coding sequence of a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 
NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

20 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

25 NO:114), Figure 67 (SEQ ID NO: 116), Figure 69 (SEQ ID NO: 1 18), Figure71 (SEQ ID NO; 123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
^ (SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

30 NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
•k 116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 

NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

35 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
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NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
5 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298) t Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 

10 NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360) > Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364) T Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 

15 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID NO:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:4I7), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

20 23. An isolated extracellular domain of of PRO polypeptide. 

24. An isolated PRO polypeptide lacking its associated signal peptide. 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
25 extracellular domain of of PRO polypeptide. 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
polypeptide lacking its associated signal peptide. 
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FIGURE 1 

CGGACGCGTGGGTGCGAGGCGAAGGTGACCGGGGACCGAGCATTTCAGATCTGCTCGGTAGA 

CCTGGTGCACCACCACCATGTTGGCTGCAAGGCTGGTGTGTCTCCGGACACTACCTTCTAGG 

GTTTTCCACCCAGCTTTCACCAAGGCCTCCCCTGTTGTGAAGAATTCCATCACGAAGAATCA 

ATGGCTGTTAACACCTAGCAGGGAATATGCCACCAAAACAAGAATTGGGATCCGGCGTGGGA 

GAACTGGCCAAGAACTCAAAGAGGCAGCATTGGAACCATCGATGGAAAAAATATTTAAAATT 

GATCAGATGGGAAGATGGTTTGTTGCTGGAGGGGCTGTGTTGGTCTTGGAGCATTGTGCTA 

CT ATGG CT TG GG ACTGTCTAAT GAG ATT GG AG CT AT TGAAAAGGCTGTAATTTGGCCTCAGT 

ATGTCAAGGATAGAATTCATTCCACCT^PATGTACTTAGCAGGGAGTATTGGTTTAACAGCT 

TTGTCTGCCATAGCAATCAGCAGAACGCCTGTTCTCATGAACTTCATGATGAGAGGCTCTTG 

GGTGACAATTGGTGTGAC CTTTGCAGCCATGGTTGGAGCTGGAATG CTGGTA CG AT CAAT AC 

CATATGACCAGAGCCCAGGCCCAAAGCATCTTGCTTGGTTGCTACATTCTGGTGTGATGGGT 

GCAGTGGTGGCTCCTCTGACAATATTAGGGGGTCCTCTTCTCATCAGAGCTGCATGGTACAC 

AGCTGGCATTGTGGGAGGCCTCTCCACTGTGGCCATGTGTGCGCCCAGTGAAAAGTTTCTGA 

ACATGGGTGCACCCCTGGGAGTGGGCCTGGGTCTCGTCTTTGTGTCCTCATTGGGATCTATG 

TTTCTTCCACCTACCACCGTGGCTGGTGCCACTCTTTACTCAGTGGCAATGTACGGTGGATT 

AGTTCTTTTCAGCATGTTCCTTCTGTATGATACCCAGAAAGTAATCAAGCGTGCAGAAGTAT 

CACCAATGTATGGAGTTCAAAAATATGATC CC AT TAACTCGATGCTGAGTATCTACATGGAT 

ACATTAAATATATTTATGCGAGTTGCAACTATGCTGGCAACTGGAGGCAACAGAAAGAAATG 

AAGTGACTCAGCTTCTGGCTTCTCTGCTACATCAAATATCTTGTTTAATGGGGCAGATATGC 

ATTAAAT AGTTT GT AC AAG C AG CTTT CG TT GAAG TTT AG AAG AT AAG AAA CATGT CAT CAT A 

TTTAAATGTTCCGGTAATGTGATGCCTCAGGTCTGCCTTTTTTTCTGGAGAATAAATGCAGT 

AATCCTCTCCCAAATAAGCACACACATTTTCAATTCTCATGTTTGAGTGATTTTAAAATGTT 

TTGGTGAATGTGAAAACTAAAGTTTGTGTCATGAGAATGTAAGTCTTTTTT^^ACTTTAAAA 

TTTAGTAGGTTCACTGAGTAACTAAAATTTAGCAAACCTGTGTTTGCATATTTTTTTGGAGT 

GCAGAATATTGTAATTAATGTCATAAGTGATTTGGAGCTTTGGTAAAGGGACCAGAGAGAAG 

GAGTCACCTGCAGTCTTTTGTTTTTTTAAATACTTAGAACTTAGCACTTGTGTTATTGATTA 

GTGAGGAGCCAGTAAGAAACATCTGGGTATTTGGAAACAAGTGGTCATTGTTACATTCATTT 

GCTGAACTTAACAAAACTGTTCATCCTGAAACAGGCACAGGTGATGCATTCTCCTGCTGTTG 

CTTCTCAGTGCTCTCTTTCCAATATAGATGTGGTCATGTTTGACTTGTACAGAATGTTAATC 

ATACAGAGAATCCTTGATGGAATTATATATGTGTGTTTTACTTTTGAATGTTACAAAAGGAA 

ATAACTTTAAAACTATTCTCAAGAGAAAATATTCAAAGCATGAAATATGTTGCTTTTTCCAG 

AATACAAACAGTATACTCATG 
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FIGURE 2 

MLAARLVCLRTLPSRVFHPAFTKAS PWKNS I TKNQWLLT PSRE YATKTR I G IRRGRTGQEL 
KEAALEPSMEKIFKIDQMGRWFVAGGAAVGLGALCYYGLGLSNEIGAIEKAVIWPQYVKDRI 
HSTYMYLxAGSIGLTALSAIAISRTPVLMNFMMRGSWVTIGVTFAAMVGAGMLVRSIPYDQSP 
GPrai^WLLHSGVMGAWAPLTILGGPLLIRAAWYTAGIVGGLSTVAMCAPSEKFLNMGAPL 

.aj GVGLGLVFVSSLGSMFLPPTTVAGATLYSVAMYGGLVLFSMFLLYDTQKVIKRAEVSPMYGV 

t 

$ QKYDP INSMLS I YMDTLNI FMRVATMLATGGNRKK 
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FIGURE 3 

GAAGGCTGCCTCGCTGGTCCGAATTCGGTGGCGCCACGTCCGCCCGTCTCCGCCTTCTGCAT 
CGCGGCTTCGGCGGCTTCCACCTAGACACCTAACAGTCGCGGAGCCGGCCGCGTCGTGAGGG 
GGTCGGCACGGGGAGTCGGGCGGTCTTGTGCATCTTGGCTACCTGTGGGTCGAAGATGTCGG 
ACATCGGAGACTGGTTCAGGAGCATCCCGGCGATCACGCGCTATTGGTTCGCCGCCACCGTC 
,: H GCCGTGCCCTTGGTCGGCAAACTCGGCCTCATCAGCCCGGCCTACCTCTTCCTCTGGCCCGA 
AGCCTTCCTTTATCGCTTTCAGATTTGGAGGCCAATCACTGCCACCTTTTATTTCCCTGTGG 
GTCCAGGAACTGGATTTCTTTATTTGGTCAATTTATATTTCTTATATCAGTATTCTACGCGA 
;$ CTTGAAACAGGAGCTTTTGATGGGAGGCCAGCAGACTATTTATTCATGCTCCTCTTTAACTG 
t GATTTGCATCGTGATTACTGGCTTAGCAATGGATATGCAGTTGCTGATGATTCCTCTGATCA 
S TGTCAGTACTTTATGTCTGGGCCCAGCTGAACAGAGACATGATTGTATCATTTTGGTTTGGA 
ACACGATTTAAGGCCTGCTATTTACCCTGGGTTATCCTTGGATTCAACTATATCATCGGAGG 
CTCGGTAATCAATGAGCTTATTGGAAATCTGGTTGGACATCTTTATTTTTTCCTAATGTTCA 
GATACCCAATGGACTTGGGAGGAAGAAATTTTCTATCCACACCTCAGTTTTTGTACCGCTGG 
CTGCCCAGTAGGAGAGGAGGAGTATCAGGATTTGGTGTGCCCCCTGCTAGCATGAGGCGAGC 
TGCTGATCAGAATGGCGGAGGCGGGAGACACAACTGGGGCCAGGGCTTTCGACTTGGAGACC 
AGTGAAGGGGCGGCCTCGGGCAGCCGCTCCTCTCAAGCCACATTTCCTCCCAGTGCTGGGTG 
CACTTAACAACTGCGTTCTGGCTAACACTGTTGGACCTGACCCACACTGAATGTAGTCTTTC 
AGTACGAGACAAAGTTTCTTAAATCCCGAAGAAAAATATAAGTGTTCCACAAGTTTCACGAT 
TCTCATTCAAGTCCTTACTGCTGTGAAGAACAAATACCAACTGTGCAAATTGCAAAACTGAC 
TACATTTTTTGGTGTCTTCTCTTCTCCCCTTTCCGTCTGAATAATGGGTTTTAGCGGGTCCT 
AATCTGCTGGCATTGAGCTGGGGCTGGGTCACCAAACCCTTCCCAAAAGGACCTTATCTCTT 
TCTTGCACACATGCCTCTCTCCCACTTTTCCCAACCCCCACATTTGCAACTAGAAAAAGTTG 
^ CCCATAAAATTGCTCTGCCCTTGACAGGTTCTGTTATTTATTGACTTTTGCCAAGGCTGGTC 
g£ ACAACAATCATATTCACGTTATTTTCCCCTTTTGGTGGCAGAACTGTTACCAATAGGGGGAG 
"3 AAGACAGCCACGGATGAAGCGTTTCTCAGCTTTTGGAATTGCTTCGACTGACATCCGTTGTT 
' AACCGTTTGCCACTCTTCAGATATTTTTTATAAAAAAAGTACCACTGAGTTCATGAGGGCCA 
CAGATTGGTTATTAATGAGATACGAGGGTTGGTGCTGGGTGTTTGTTTCCTGAGCTAAGTGA 
TCAAGACTGTAGTGGAGTTGCAGCTAACATGGGTTAGGTTTAAACCATGGGGGATGCACCCC 
TTTGCGTTTCATATGTAGCCCTACTGGCTTTGTGTAGCTGGAGTAGTTGGGTTGCTTTGTGT 
TAGGAGGATCCAGATCATGTTGGCTACAGGGAGATGCTCTCTTTGAGAGGTCCTGGGCATTG 
ATTCCCATTTC7VATCTCATTCTGGATATGTGTTCATTGAGTAAAGGAGGAGAGACCCTCATA 
CGCTATTTAAATGTCACTTTTTTGCCTATCCCCCGTTTTTTGGTCATGTTTCAATTAATTGT 
GAGGAAGGCGCAGCTCCTCTCTGCACGTAGATCATTTTTTAAAGCTAATGTAAGCACATCTA 
AGGGAATAACATGATTTAAGGTTGAAATGGCTTTAGAATCATTTGGGTTTGAGGGTGTGTTA 
TTTTGAGTCATGAATGTACAAGCTCTGTGAATCAGACCAGCTTAAATACCCACACCTTTTTT 
TCGTAGGTGGGCTTTTCCTATCAGAGCTTGGCTCATAACC7VAATAAAGTTTTTTGAAGGCCA 
TGGCTTTTCACACAGTTATTTTATTTTATGACGTTATCTGAAAGCAGACTGTTAGGAGCAGT 
ATTGAGTGGCTGTCACACTTTGAGGCAACTAAAAAGGCTTCAAACGTTTTGATCAGTTTCTT 
TTCAGGAAACATTGTGCTCTAACAGTATGACTATTCTTTCCCCCACTCTTAAACAGTGTGAT 
GTGTGTTATCCTAGGAAATGAGAGTTGGCAAACAACTTCTCATTTTGAATAGAGTTTGTGTG 
TACTTCTCCATATTTAATTTATATGATAAAATAGGTGGGGAGAGTCTGAACCTTAACTGTCA 
TG TT TTGT TG TT CAT CTGTGGCCACAATAAAGTTT ACT TGTAAAATTTTAGAGGC CAT TACT 
j*i CCAATTATGTTGCACGTACACTCATTGTACAGGCGTGGAGACTCATTGTATGVATAAGAATA 
TTTCTGACAGTGAGTGACCCGGAGTCTCTGGTGTACCCTCTTACCAGTCAGCTGCCTGCGAG 
CAGTCATTTTTTCCTAAAGGTTTACAAGTATTTAGAACTTTTCAGTTCAGGGCAAAATGTTC 
ATGAAGTTATTCCTCTTAAACATGGTTAGGAAGCTGATGACGTTATTGATTTTGTCTGGATT 
I ATGTTTCTGGAATAATTTTACCAAAACAAGCTATTTGAGTTTTGACTTGACAAGGCAAAACA 
" ! TGACAGTGGATTCTCTTTACAAATGGAAAAAAAAAATCCTTATTTTGTATAAAGGACTTCCC 
TTTTTGTAAACTAATCCTTTTTATTGGTAAAAATTGTAAATTAAAATGTGCAACTTG 



BNSDOCID: <WO .... .9963088A2 . 1. > 



WO 99/63088 




PCT/US99/12252 



FIGURE 4 

MSD I GDWFRS I PA I TRYWFAATVAVPLVGKLGL I S P AYLFLWPEAFL YRFQ I WRP I TATF YF 
PVGPGTGFL YLVNLYFL YQYSTRLE TGAFDGR PADYLFMLLFNW I C I V I TGL AMDMQLLM I P 
LIMSVLYVWAQLNRDMIVSFWFGTRFKACYLPWVILGFNYI IGGSVINELIGNLVGHLYFFL 
MFRYPMDLGGRNFLSTPQFLYRWLPSRRGGVSGFGVPPASMRRAADQNGGGGRHNWGQGFRLGDQ 



n 

v.' 
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FIGURE 5 

GGGGCCGCGGTCTAGGGCGGCTACGTGTGTTGCCATAGCGACCATTTTGCATTAACTGGTTG 

GTAGCTTCTATCCTGGGGGCTGAGCGACTGCGGGCCAGCTCTTCCCCTACTCCCTCTCGGCT 

CCTTGTGGCCCAAAGGCCTAACCGGGGTCCGGCGGTCTGGCCTAGGGATCTTCCCCGTTGCC 

CCTTTGGGGCGGGATGGCTGCGGAAGAAGAAGACGAGGTGGAGTGGGTAGTGGAGAGCATCG 

CGGGGTTCCTGCGAGGCCCAGACTGGTCCATCCCCATCTTGGACTTTGTGGAACAGAAATGT 

GAAGTTAACTGCAAAGGAGGGCATGTGATAACTCCAGGAAGCCCAGAGCCGGTGATTTTGGT 

GGCCTGTGTTCCCCTTGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGATTC 

ATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAATT 

AATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAGGC 

CATTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCCAGA 

AAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTACCT 

GACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAATCCT 

GAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAAGAAGAAAGGAAGAGGAAAA 

AACAGTTATCAGAGGCTAAAACAGAAGAGCCCACAGTGCATTCCAGTGAAGCTGCAATAATG 

AATAATTCCCAAGGGGATGGTGAACATTTTGCACACCCACCCTCAGAAGTTAAAATGCATTT 

TGCTAATCAGTCAATAGAACCTTTGGGAAGAAAAGTGGAAAGGTCTGAAACTTCCTCCCTCC 

CACAAAAAGGCCTGAAGATTCCTGGCTTAGAGCATGCGAGCATTGAAGGACCAATAGCAAAC 

TTATCAGTACTTGGAACAGAAGAACTTCGGCAACGAGAACACTATCTCAAGCAGAAGAGAGA 

TAAGTTGATGTCCATGAGAAAGGATATGAGGACTAAACAGATACAAAATATGGAGCAGAAAG 

GAAAACCCACTGGGGAGGTAGAGGAAATGACAGAGAAACCAGAAATGACAGCAGAGGAGAAG 

CAAACATTACTAAAGAGGAGATTGCTTGCAGAGAAACTCAAAGAAGAAGTTATTAATAAGTA 

ATAATTAAGAACAATTTAACAAAATGGAAGTTCAAATTGTCTTAAAAATAAATTATTTAGTC 
CTTACACTG 
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FIGURE 6 

MAAEEEDEVEWVVES I AGFL.RGPDWS I P I LDFVEQKCEVNCKGGHVITPGS PEPVILVACVP 
LVFDDEEESKLTYTEIHQEYKELVEKLLEGYLKEIGINEDQFQEACTSPLAKTHTSQAILQP 
VLAAEDFT I F KAMMVQ KN I EMQLQ A I R 1 1 QERNG VL PD CL.TDGSD WS DLEHEEMK I LRE VL 
RKSKEEYDQEEERKRKKQLSEAKTEEPTVHSSEAAIMNNSQGDGEHFAHPPSEVKMHFANQS 
I E PLGRKVER SETS SL PQ KGLK I PGLEH AS IEGPI ANL SVLGTEELRQREHYLKQKRDKLMS 
MRKDMRTKQIQNMEQKGKPTGEVEEMTEKPEMTAEEKQTLLKRRLLAEKLKEEVINK 
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GGGCACAGCACATGTGAAGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGAT 
TCATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAA 
TTAATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAG 
GCCATTTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCC 
I AGAAAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTA 
I CC TG ACTG CTT AAC CGATGGCT CTG ATGTGGT CAGT GAC CTTGAACACGAAGAGATGAAAAT 

CCTGAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAA 



i 
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FIGURE 8 

GCGTGGTTTTTGTTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTG 
TAGCTTCTCCACGTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTG 
TCTCAGCTCTAGGATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAAC 
AGTGGAATGGAAAAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATAC 
ATTCCTGCTAGGTGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATT 
CTGCCAATGAAGAAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTG 
GTGAAGCTAGTTTTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAG 
AAATTTGAAATATGCTT CCTGG AAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCT 
TTCTTTATTTCCTGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATG 
GCTGTTATCTTCTCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAA 
GAG"7GTCTAAACTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCT 
TGACTGCCGGGACTAAAACTTTACAGCACAACTTGGCAGGACGTGGATTTCATCACGATGCC 
TTTTTCAGCCCTTCCAATTCCTGCCTTCTTTTCAGAAGTGAGTGTCCCAGAAAAGACAATTG 
T A CAG C AAAGGAATGG ACTT TT C CTGAAG CTAAAT GG AA C AC C A CAGCC AG AGTTTTC AG TC 
ACATCCGTCTTGGCATGGGCCATGTTCTTATTATAGTCCAGTGTTTTATTTCTTCAATGGCT 
AATATCTATAATGAAAAGATACTGAAGGAGGGGAACCAGCTCACTGAAAGCATCTTCATACA 
GAACAGCAAACTCTATTTCTTTGGCATTCTGTTTAATGGGCTGACTCTGGGCCTTCAGAGGA 
GTAACCGTGATCAGATTAAGAACTGTGGATTTTTTTATGGCCACAGTGCATTTTCAGTAGCC 
CTTATTTTTGTAACTGCATTCCAGGGCCTTTCAGTGGCTTTCATTCTGAAGTTCCTGGATAA 
CATGTTCCATGTCTTGATGGCCCAGGTTACCACTGTCATTATCACAACAGTGTCTGTCCTGG 
TCTTTGACTTCAGGCCCTCCCTGGAATTTTTCTTGGAAGCCCCATCAGTCCTTCTCTCTATA 
TTTATTTATAATG C CAG CAAGC CTCAAG TT CC GGAA T A CG CACC T AGG C AAGAAAGGATCCG 
AGATCTAAGTGGCAATCTTTGGGAGCGTTCCAGTGGGGATGGAGAAGAACTAGAAAGACTTA 
CCAAACCCAAGAGTGATGAGTCAGATGAAGATACTTTC TAA CTGGTACCCACATAGTTTGCA 
GCTCTCTTGAACCTTATTTTCACATTTTCAGTGTTTGTAATATTTATCTTTTCACTTTGATA 
AACCAGAAATGTTT CTAAAT CCTAAT ATT CTTTGCATATATCTAGCTACTCCCTAAATGGTT 
CCATCCAAGGCTTAGAGTACCCAAAGGCTAAGAAATTCTAAAGAACTGATACAGGAGTAACA 
ATATGAAGAATTCATTAATATCTCAGTACTTGATAAATCAGAAAGTTATATGTGCAGATTAT 
TTTCCTTGGCCTTCAAGCTTCCAAAAAACTTGTAATAATCATGTTAGCTATAGCTTGTATAT 
ACACATAGAGATCAATTTGCCAAATATTCACAATCATGTAGTTCTAGTTTACATGCCAAAGT 
CTTCCCTTTTTAACATTATAAAAGCTAGGTTGTCTCTTGAATTTTGAGGCCCTAGAGATAGT 
CATTTTGCAAGTAAAGAGCAACGGGACCCTTTCTAAAAACGTTGGTTGAAGG AC CTAAAT AC 
CTGGCCATACCATAGATTTGGGATGATGTAGTCTGTGCTAAATATTTTGCTGAAGAAGCAGT 
TTCTCAGACACAACATCTCAGAATTTTAATTTTTAGAAATTCATGGGAAATTGGATTTTTGT 
AATAATCTTTTGATGTTTTAAACATTGGTTCCCTAGTCACCATAGTTACCACTTGTATTTTA 
AGTCATTTAAACAAGCCACGGTGGGGCTTTTTTCTCCTCAGTTTGAGGAGAAAAATCTTGAT 
GT CATTACT C CTG AATT ATTA CATTT TGGAGAAT AAG AGG GC AT TTT ATT TT AT TAGT TACT 
AATTCAAGCTGTGACTATTGTATATCTTTCCAAGAGTTGAAATGCTGGCTTCAGAATCATAC 
CAGATTGTCAGTGAAGCTGATGCCTAGGAACTTTTAAAGGGATCCTTTCAAAAGGATCACTT 
AGCAAACACATGTTGACTTTTAACTGATGTATGAATATTAATACTCTAAAAATAGAAAGACC 
AGTAATATATAAGTCACTTTACAGTGCTACTTCACACTTAAAAGTGCATGGTATTTTTCATG 
GTATTTTGCATGCAGCCAGTTAACTCTCGTAGATAGAGAAGTCAGGTGATAGATGATATTAA 
AAATTAGCAAACAAAAGTGACTTGCTCAGGGTCATGCAGCTGGGTGATGATAGAAGAGTGGG 
CTTTAACTGGCAGGCCTGTATGTTTACAGACTACCATACTGTAAATATGAGCTTTATGGTGT 
CATTCTCAGAAACTTATACATTTCTGCTCTCCTTTCTCCTAAGTTTCATGCAGATGAATATA 
AGGTAATATACTATTATATAATTCATTTGTGATATCCACAATAATATGACTGGCAAGAATTG 
GTGG AAATTTGT AATT AAAAT AATT ATT AAACCT 
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FIGURE 9 

MEKQCCSHPVICSLSTMYTFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVCSELVK 
LVFCVLVSFCVIKKDHQSRNLKYASWKEFSDFMKWSIPAFLYFLDNLIVFYVLSYLQPAMAV 
IFSNFSI ITTALLFRIVLKRRLNWIQWASLLTLFLS IVALTAGTKTLQHNLAGRGFHHDAFF 
SPSNSCLLFRSECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLIIVQCFISSMANI 
YNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFYGHSAFSVALI 
FVTAFQGLSVAFILKFLDNMFHVLMAQVTTVI ITTVSVLVFDFRPSLEFFLEAPSVLLSIFI 
YNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESDEDTF 
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FIGURE 10 

CGTGCCTGCGCAATGGGTGTCGGGTCCGCTTTTTCCCAATCCGGACGTAATCGTGGTTTTTG 
TTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTGTAGCTTCTCCAC 
GTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTGTCTCAGCTCTAG 
GATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAACAGTGGAATGGAA 
AAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATACATTCCTGCTAGG 
TGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATTCTGCCAATGAAG 
AAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTGGTGAAGCTAGTT 
TTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAGAAATTTGAAATA 
TGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCTTTCTTTATTTCC 
TGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATGGCTGTTATCTTC 
TCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAAGAGGCGTCTAAA 
CT GGAT C CAGTGGG CTT CCCT C CTGACTTTATTTTTGTCT AT TGTGGCCTTG ACT GCCGGGA 
CTAAAACTTTA 
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FIGURE 1 1 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGGCCGGCTTGGCTAGCGCGCGGCGGCC 
GTGGCTAAGGCTGCTACGAAGCGAGCTTGGGAGGAGCAGCGGCCTGCGGGGCAGAGGAGCAT 
CCCGTCTACCAGGTCCCAAGCGGCGTGGCCCGCGGGTCATGGCCAAAGGAGAAGGCGCCGAG 
AGCGGCTCCGCGGCGGGGCTGCTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 
GGTGAAGAAAGAACCGAAAAAGAAGAAACAACAGTTGTCTGTTTGCAACAAGCTTTGCTATG 
CACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCAGATCTAC 
CTATTGGATGTGGCTCAGGTGGGCCCTTTCTCTGCCTCCATCATCCTGTTTGTGGGCCGAGC 
CTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCAAATCCCCCTGGACCTGCC 
TGGGTCGCCTTATGCCCTGGATCATCTTCTCCACGCCCCTGGCCGTCATTGCCTACTTCCTC 
ATCTGGTTCGTGCCCGACTTCCCACACGGCCAGACCTATTGGTACCTGCTTTTCTATTGCCT 
CTTTGAAACAATGGTCACGTGTTTCCATGTTCCCTACTCGGCTCTCACCATGTTCATCAGCA 
ACCGAGCAGACTGAGCGGGATTCTGCCACCGCCTATCGGATGACTGTGGAAGTGCTGGGCAC 
AGTGCTGGGCACGGCGATCCAGGGACAAATCGTGGGCCAAGCAGACACGCCTTGTTTCCAGG 
ACTTCAATAGCTCTACAGTAGCTTCACAAAGTGCCAACCATACACATGGCACCACTTCACAC 
AGGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTG 
TGCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTG 
AGCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTT 
ATTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTT 
TTGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCT 
CGGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCT 
GTATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAA 
CCTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTAC 
TACCCTGGTCCATGCTGCGTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCAT 
GGAACCGAGCCCATCTTCTTCTCCTTCTATGTCTTCTTCACCAAGTTTGCCTCTGGAGTGTC 
ACTGGGCATTTCTACCCTCAGTCTGGACTTTGCAGGGTACCAGACCCGTGGCTGCTCGCAGC 
CGGAACGTGTCAAGTTTACACTGAACATGCTCGTGACCATGGCTCCCATAGTTCTCATCCTG 
CTGGGCCTGCTGCTCTTCAAAATGTACCCCATTGATGAGGAGAGGCGGCGGCAGAATAAGAA 
GGCCCTGCAGGCACTGAGGGACGAGGCCAGCAGCTCTGGCTGCTCAGAAACAGACTCCACAG 
AGCTGGCTAGCATCCTCTAGGGCCCGCCACGTTGCCCGAAGCCACCATGCAGAAGGCCACAG 
AAGGGATCAGGACCTGTCTGCCGGCTTGCTGAGCAGCTGGACTGCAGGTGCTAGGAAGGGAA 
CTGAAGACTCAAGGAGGTGGCCCAGGACACTTGCTGTGCTCACTGTGGGGCCGGCTGCTCTG 
TGGCCTCCTGCCTCCCCTCTGCCTGCCTGTGGGGCCAAGCCCTGGGGCTGCCACTGTGAATA 
TGCCAAGGACTGATCGGGCCTAGCCCGGAACACTAATGTAGAAACCTTTTTTTTACAGAGCC 
TAATTAATAACTTAATGACTGTGTACATAGCAATGTGTGTGTATGTATATGTCTGTGAGCTA 
TTAATGTTATTAATTTTCATAAAAGCTGGAAAGC 
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FIGURE 12 

IWLRWALSLPPSSCLWAEPGMPSQTPWWASASANPPGPAWVALiCPGSSSPRPWPSLPTSSSG 
SCPTSHTARPIGTCFSIASLKQWSRVSMFPTRLSPCSSATEQTERDSATAYRMTVEVLGTVL 
GTAIQGQIVGQADTPCFQDFNSSTVASQSANHTHGTTSHRETQKAYLLAAGVIVCIYIICAV 
ILILGVREQREPYEAQQSEPIAYFRGLRLVMSHGPYIKLITGFLFTSLAFMLVEGNFVLFCT 
YTLGFRNE FQNLLLA I MLS ATLT I P I WQWFLTRFGK KT AV YVGI S S AVP FL I L VALMESNL I 
ITYAVAVAAGISVAAAFLLPWSMLPDVIDDFHLKQFHFHGTEPIFFSFYVFFTKFASGVSLG 
I STLSLDFAGYQTRGCSQPERVKFTLNMLVTMAP I VL I LLGLLLFKMYP I DEERRRQNKKAL 
QALRDEASSSGCSETDSTELAS IL 
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FIGURE 13 

GGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTGT 
GCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTGA 
GCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTTA 
TTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTTT 
TGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCTC 
GGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCTG 
TATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAAC 
CTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTACT 
ACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCATG 
GAACCGAGCCCAT 



BNSDOCID: <WO.__.9963088A2 ..!.._> 



WO 99/63088 ^\ ^\ O PCT/US99/12252 

FIGURE 14 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGT 
ATGAGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGT 
GGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCT 
TTCATATTTTCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATAT 
CAGTGACACTGGTACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGG 
CAGTTTTATGCATTGCTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAA 
GAGAACGTTATCATCAAATTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGG 
ACTTTCTATTGTGGCAAACTTCCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTG 
TGCTTACCTTTGGTATGGGCTCATTATATATGTTTGTTCAGACCATCCTTTCCTACCAAATG 
CAGCCCAAAATCCATGGCAAACAAGTCTTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGG 
AGTAAGTGCACTTAGCATGCTGACTTGCTCATCAGTTTTGCACAGTGGCAATTTTGGGACTG 
ATTTAGAACAGAAACTCCATTGGAACCCCGAGGACAAAGGTTATGTGCTTCACATGATCACT 
ACTGCAGCAGAATGGTCTATGTGATTTTCCTTCTTTGGTTTTTTCCTGACTTACATTCGTGA 
TTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACATGGATTAACCCTCTATGACACTG 
CACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCAGAGATATTTGATGAAAGGAT 
AAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGGTTCACAGAAGTTGCTTA 
TTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACTGATGAATGCTGATA 
AT CAGGAAACATGAAAGAAG C C AT TTGATAG ATT AT TC T AAAGG AT AT C ATCAAGAAGACTA 
TTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACTATG 
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FIGURE 15 

MWWFQQGLSFLPSALV1WTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNI 
AAVLCIATIYWYKQVHTUliSPEENVI IKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSG 
AVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFG 
TDLEQKLHWN PEDKGYVLHM I TTAAEWS MS FS FFGF FLT Y I RDFQK I S LR VEANLHGLTLYD 
TAPCPINNERTRLLSRDI 
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FIGURE 16 

CGGACGCTTGGGCNGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGTGCCTGATGCCGAGT 
TCCGTCTCTCGGGTCTTTTCCTGGTCCCAGGCAAAGCGGAGCGGAGATCCTCAAACGGCCTA 
GTGCTTCGCGCTTCCGGAGAAAATCAGCGGTCTAATTAATTCCTCTGGTTTGTTGAAGCAGT 
TACCAAGAATCTTCAACCCTTTCCCACAAAAGCTAATTGAGTACACGTTCCTGTTGAGTACA 
CGTTCCTGTTGATTTACAAAAGGTGCAGGTATGAGCAGGTCTGAAGACTAACATTTTGTGAA 
GTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCT 
TCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTTTCATACATTACTGCAGTAACACT 
CCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGGTACAGTAJSFC 
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FIGURE 17 

CCCACGCGTCCGCCCGCCGCTGCGTCCCGGAGTGCAAGTGAGCTTCTCGGCTGCCCCGCGGG 
CCGGGGTGCGGAGCCGACATGCGCCCGCTTCTCGGCCTCCTTCTGGTCTTCGCCGGCTGCAC 
CTTCGCCTTGTACTTGCTGTCGACGCGACTGCCCCGCGGGCGGAGACTGGGCTCCACCGAGG 
AGGCTGGAGGCAGGTCGCTGTGGTTCCCCTCCGACCTGGCAGAGCTGCGGGAGCTCTCTGAG 
GTCCTTCGAGAGTACCGGAAGGAGCACCAGGCCTAC GTGTTCCTGCTCTTCTGCGGCGCCTA 
CCTCTACAAACAGGGCTTTGCCATCCCCGGCTCCAGCTTCCTGAATGTTTTAGCTGGTGCCT 
TGTTTGGGCCATGGCTGGGGCTTCTGCTGTGCTGTGTGTTGACCTCGGTGGGTGCCACATGC 
TGCTACCTGCTCTCCAGTATTTTTGGCAAACAGTTGGTGGTGTCCTACTTTCCTGATAAAGT 
GGCCCTGCTGCAGAGAAAGGTGGAGGAGAACAGAAACAGCTTGTTTTTTTTCTTATTGTTTT 
TGAGACTTTTCCCCATGACACCAAACTGGTTCTTGAACCTCTCGGCCCCAATTCTGAACATT 
CCCATCGTGCAGTTCTTCTTCTCAGTTCTTATCGGTTTGATCCCATATAATTTCATCTGTGT 
GCAGACAGGGTCCATCCTGTCAACCCTAACCTCTCTGGATGCTCTTTTCTCCTGGGACACTG 
TCTTTAAGCTGTTGGCCATTGCCATGGTGGCATTAATTCCTGGAACCCTCATTAAAAAATTT 
AGTCAGAAACATCTGCAATTGAATGAAA CAAG TA CT GC TAAT CATATACACAGTAGAAAAGA 
CACATGATCTGGATTTTCTGTTTGCCACATCCCTGGACTCAGTTGCTTATTTGTGTAATGGA 
TGTGGTCCTCTAAAGCCCCTCATTGTTTTTGATTGCCTTCTATAGGTGATGTGGACACTGTG 
CATCAATGTGCAGTGTCTTTTCAGAAAGGACACTCTGCTCTTGAAGGTGTATTACATCAGGT 
TTTCAAACCAGCCCTGGTGTAGCAGACACTGCAACAGATGCCTCCTAGAAAATGCTGTTTGT 
GGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCCGGTGATTC 
ACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAAGATGGTGAAATCCTGTCTCTAATAAAAAT 
ACAAAAATTAGCCAGGCGTGGTGGCAGGCACCTGTAATCCCAGCTACTCGGGAGGCTGAGGC 
AGGAGAATTGCTTGAACCAAGGTGGCAGAGGTTGCAGTAAGCCAAGATCACACCACTGCACT 
CCAGCCTGGGTGATAGAGTGAGACACTGTCTTGAC 
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FIGURE 18 

MRPLLGLLLVFAGCTFALYLLSTRLPRGRRLGSTEEAGGRSLWFPSDLAELRELSEVLREYR 
KEHQAYVFLLFCGAYLYKQGFAIPGSSFIiNVIiAGALFGPWLGLLLCCVLTSVGATCCYLLSS 
I FGKQLVVS YFPD KVALLQRKVEENRNS LF FFLL FLRL F PMT PNWFLNLS AP I LN I P I VQ FF 
FSVLIGL.I PYNF I CVQTGS ILSTLTSLDALFSWDTVFKLLAI AMVALI PGTLIKKFSQKHLQ 
LNETSTANHIHSRKDT 
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FIGURE 19 

CCGAGGCGGGAGGAGCCCGAGGGGGCGCGAGC CCCG CATGAATCATTGTAGTCAATCATTTT 
CCAGTTCTCAGCCGCTCAGTTGTGATCAAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAA 
TAGGAAAATAACTTGGGATTTTATATTGGAAGACATGGATCTTGCTGCCAACGAGATCAGCA 
TTTATGACAAACTTT CAGAGACTGTTGATTTGGTGAGACAGACCGG CC AT CAGTGTGGCATG 
TCAGAGAAGGCAATTGAAAAATTTAT CAGACAGCTGCTGGAAAAGAATGAACCT CAGAGACC 
CCCCCCGCAGTATCCTCTCCTTATAGTTGTGTATAAGGTTCTCGCAACCTTGGGATTAATCT 
TGCTCACTGCCTACTTTGTGATTCAACCTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTCT 
GGAGCTCACACCTGGCGCTCACTCATCCATCACATTAGGCTGATGTCCTTGCCCATTGCCAA 
GAAGTACATGTCAGAAAATAAGGGAGTTCCTCTGCATGGGGGTGATGAAGACAGACCCTTTC 
CAGACTTTGACCCCTGGTGGACAAACGACTGTGAGCAGAATGAGTCAGAGCCCATTCCTGCC 
AACTGCACTGGCTGTGCCCAGAAACACCTGAAGGTGATGCTCCTGGAAGACGCCCCAAGGAA 
ATTTGAGAGG CT C CAT C CACTGGTGATCAAGACGGG AAAG C C CC TGT TGG AGGAAG AGATTC 
AGCATTTTTTGTGCCAGTACCCTGAGGCGACAGAAGGCTTCTCTGAAGGGTTTTTCGCCAAG 
TGGTGGCGCTGCTTTCCTGAGCGGTGGTTCCCATTTCCTTATCCATGGAGGAGACCTCTGAA 
CAGATCACAAATGTTACGTGAGCTTTTTCCTGTTTTCACTCACCTGCCATTTCCAAAAGATG 
CCTCTTTAAACAAGTGCTCCTTTCTTCACCCAGAACCTGTTGTGGGGAGTAAGATGCATAAG 
ATGCCTGACCTATTTATCATTGGCAGCGGTGAGGCCATGTTGCAGCTCATCCCTCCCTTCCA 
GT GC CG AAG AC ATT GT C AGTCTGTGGCC AT G C CAAT AG AG CCAGGGG AT AT CGGCTATGTCG 
AC AC C AC CC ACT GG AAGGT C T ACGTT AT AG C C AGAGGGGTC C AG CCTTTGGTCAT CTG CG AT 
GGAACCGCTTTCTCAGAACTGTAGGAAATAGAACTGTGCACAGGAACAGCTTCCAGAGCCGA 
AAACCAGGTTGAAAGGGGAAAAATAAAAACAAAAACGATGAAACTGCAAAAA 
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FIGURE 20 

MDLAANE I S I YDKLSETVDLVRQTGHQCGMSEKA I E KF I RQLLEKNEPQRPP PQ YPLL I VVY 
KVLATLGLILLTAYFVIQPFSPLAPEPVLSGAHTWRSLIHHIRLMSLPIAKKYMSENKGVPL 
HGGDEDRPFPDFDPWWTNDCEQNESEPIPANCTGCAQKHLKVMLLEDAPRKFERLHPLVIKT 
GKPLLEEEIQHFLCQYPEATEGFSEGFFAKWWRCFPERWFPFPYPWRRPLNRSQMLRELFPV 
FTHLPFPKDASLNKCSFLHPEPWGSKMHKMPDLFI IGSGEAMLQLIPPFQCRRHCQSVAMP 
I E PGD I GYVDTTHWKVYVI ARGVQPLVI CDGTAFS EL 
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FIGURE 21 

CCACGGTGTCCGTTCTTCGCCCGGCGGCAGCTGTCCCCGAGGCGGGAGGAGCCCGAGGGGCG 
CGAGCCCCGCATGAATCATTGTAGTCAATCATTTTCCAGTTCTCAGCCGTTCAGTTGTGATC 
AAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAATAGGAAAATAACTTGGGATTTTATATT 
GGAAGACATGGATCTTGCTGCCAACGAGATCAGCATTTATGACAAACTTTCAGAGACTGTTG 
ATTTGGTG AGAC AG AC CGG C CATCAGTG TGG CAT GT C AGAGAAGGC AATTG AAAAATTTATC 
AGACAGCTGCTGGAAAAGAATGAACCTCAGAGACCCCCCCCGCAGTATCCTCTCCTTATAGT 
TGTGTATAAGGTTCTCGCAACCTTGGGATTAATCTTGCTCACTGCCTACTTTGTGATTCAAC 
CTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTGTGGAGCTCAC 
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FIGURE 22 



CCCACGCGTCCGCCCACGCGTCCGGCTGAACACCTCTTCTTTGGAGTCAGCCACTGATGAGG 
CAGGGTCCCCACTTGCAGCTGCAGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTG 
CCACTGGTGCGCACGCTGCTAGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGACTGCC 
CTCCCTGCCACCCACCAATGGCAGCCCCACCTTCTTTGAAGACTTCCAGGCTTTTTGTGCCA 
CACCCGAATGGCGCCACTTCATCGACAAACAGGTACAGCCAACCATGTCCCAGTTCGAAATG 
GACACGTATGCTAAGAGCCACGACCTTATGTCAGGTTTCTGGAATGCCTGCTATGACATGCT 
TATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAGAGTCGTCGGGCCTTCCAGGAGC 
TGGTGCTGGAACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTG 
AAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGCGCTGTGGCGCCAGCT 
CGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCA 
GCGCCGAGACATATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACTTCGACCCTCAC 
CTGGAAGCCAGCGCTGTCCGAGACAATCTGGGTGAGGTTCCCCTGACACCCACCGAGGAGGC 
CTCACTGCCTCTGGCAGTGACCAAAGAGGCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGG 
AGGACCAGCTCGGCGAGGACGAGCTGGCTGAGCTGGAGACCCCGATGGAGGCAGCAGAACTG 
GATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGT 
GGTCCCAGGGCTGCTGGAGGTCACCACACAGAATGTATACTTCTACGATGGCAGCACTGAGC 
GCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAG 
GTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGC 
GAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGA 
CTCCGAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCG 
TGGCTCCTGCGCCTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGAT 
GCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACT 
TGATGCAACTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTC 
CCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCG 
GGACCTGTCTAAGGCCATCGGTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGT 
ATGAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTCCACTATGGCACCCACTACTCC 
AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCACGTCCA 
GCTGCAAAGTGGCCGCTTTGACTGCTCCGACCGGCAGTTCCACTCGGTGGCGGCAGCCTGGC 
AGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTACTTTCCT 
GACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGT 
AGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATGCAGCAGCACCGCC 
AGGCTCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGC 
TACAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 
GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTA 
TCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCA 
GCTGAGGAAGCAGCCCATCGGCTTGCACGCCTGGACACTAACTCACCTAGCATCTTCCAGCA 
CCTGGACGAACTCAAGGCATTCTTCGCAGAGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCA 
GCCACAGCTGGTTGCCCTATGACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCC 
AC CAT GGG C AG CC AC AAGACGC AG CGACTGCTG AG TGGCCCGTGGGTGCCAGGCAGTGGTGT 
GAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGG 
ATGGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCAC 
CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCG 
GGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCAC 
CAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACT 
GAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCG 
CGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACC 
TGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCC 
CAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCT 
GGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGT 
GCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAG 
GTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGA 
TGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCGCGC 
GGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAAC 
CCTACTGAGGCGCGCTGAACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTG 
GCCCGGGAGGCCCCGCCCAGAAGTCGGCGGGAACACCCCGGGGTGGGCAGCCCAGGGGGTGA 
GCGGGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCG 
GGCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCCAGCACTGGCGTCT 
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FIGURE 23 

MSQFEMDTYAKSHDLMSGFWNACYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLEGL 
RYTAVLKQQATQHSMALLHWGALWRQLASPCGAWALRDTPIPRWKLSSAETYSRMRLKLVPN 
HHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDELAELETP 
MEAAELDEQREKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEGIGYDFRRP 
LAQLREVHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQPGPIPPHTQV 
RNQVYS WLLRLRPPSQGYL.S SRSPQEMLRASGLTQK WQRE I SNFE YLMQLNTI AGRTYNDL 
SQYPVFPWVLQDYVSPTLDLSNPAVFRDLSKPIGVWPKHAQLVREKYESFEDPAGTIDKFH 
YGTHYSNAAGVMHYLIRVEPFTSLHVgLQSGRFDCSDRQFHSVAAAWQARLESPADVKELIP 
EFFYFPDFLENQNGFDLGCLQLTNEKVGDWLPPWASSPEDFIQQHRQALESEYVSAHLHEW 
IDLIFGYKQRGPAAEEALNVFYYCTYEGAVDLDHVTDERERKALEGI ISNFGQTPCQLLKEP 
HPTRLSAEEAAHRLARLDTNSPSIFQHLDELKAFFAEVTVSASGLLGTHSWLPYDRNISNYF 
SFSKDPTMGSHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLL 
SQLSCHLDWTCLALDTCGIYLISGSRDTTCMVWRLLHQGGLSVGL.APKPVQVLYGHGAAVS 
CVAISTELDMAVSGSEDGTVIIHTVRRGQFVAALRPLGATFPGPIFHLALGSEGQIWQSSA 
WERPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTEDFVLLGTAQCALHILQLNTLLPAA 
P PLPMKVAI RS VAVTKERSHVLVGLEDGKL I VWAGQP SEVRS SQFARKLWRS SRRI SQVS S 
GETEYNPTEAR 
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FIGURE 24 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCC 
CACGGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCAT 
CCAAAGGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTC 

^ TGGACCCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTT 
CTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCC 

I GCACACTCCGTTACCACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAG 
ATAGCCCGGGTCATCTTGGAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGC 
C^GCTGCATCATGTGCTGTTTCAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCC 
TAAACCGCAATGCATACATCATGATCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAA 
AATGCGTTCATGCTACTCATGCGAAACATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGA 
CCTGCTGCTGTTCTTTGGGAAGCTGCTGGTGGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTT 
TTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAGACTTTAAGAGCCCCCACCTCAACTATTAC 
TGGCTGCCCATCATGACCTCCATCCTGGGGGCCTATGTCATCGCCAGCGGCTTCTTCAGCGT 

I TTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCTGGAAGACCTGGAGCGGAACAACG 
GCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAAAGATTCTGGGCAAGAAGAAC 
GAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGGCCCTGATCCAGGACTGC 
ACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGTCTCCATTTTGTGGT 
AAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACTTTGAGAGGCTG 
AGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTCC 
GTCTCTATTAA^aAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCAGCTAC 

tcgggaggctgaggcaggagaatcgcttgaacccgggaggcagaggttgcagtgagccgaga 
tcgcgccactgcactccaacctgggtgacagactctgtctccaaaacaaaacaaacaaacaa 
aaagatttt att aaagatattttgttaact c 



BNSDOCID: < WO.. _ 9963088A2 I > 



WO 99/63088 f^^XD PCT/US99/12252 

FIGURE 25 

RTRGRTRGGCEKVP INTS CNPTAHLVNS S CPGLMCVFQG YSS KGL I QRSVFNLQ I YGVLGLF 
WTLNWVLALGQCVLAGAFAS F YWAFHKPQD I P TF PL I S AF IRTLRYHTGS LAFGALI L.TLVQ 
IARVILEYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAK 
NAFMLLMRNIVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYY 
WLPIMTSILGAYVIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKN 
EAPPDNKKRKK 
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FIGURE 26 

GAGTCTTGACCGCCGCCGGGCTCTTGGTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGT 
GGCTATGTTCGTGTCCGATTTCCGCAAAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCC 
TTCTCTTCGTGGCCTCGGACGTGGATGCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTC 
CAGTGTGACCACGTGCAATATACGCTGGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGC 
ATTTCTTGAGCATAAAGAACAGTTTCATTATTTTATTCTCATAAACTGTGGAGCTAATGTAG 
ACCTATTGGATATTCTTCAACCTGATGAAGACACTATATTCTTTGTGTGTGACTCCCATAGG 
CCAGTCAATGTCGT CAATGT AT AC AACGAT AC CC AG AT CAAATTACTCATTAAACAAGATGA 
TGACCTTGAAGTTCCCGCCTATGAAGACATCTTCAGGGATGAAGAGGAGGATGAAGAGCATT 
CAGGAAATGACAGTGATGGGTCAGAGCCTTCTGAGAAGCGCACACGGTTAGAAGAGGAGATA 
GTGGAGCAAACCATGCGGAGGAGGCAGCGGCGAGAGTGGGAGGCCCGGAGAAGAGACATCCT 
CTTTGACTACGAGCAGTATGAATATCATGGGACATCGTCAGCCATGGTGATGTTTGAGCTGG 
CTTGGATGCTGTCCAAGGACCTGAATGACATGCTGTGGTGGGCCATCGTTGGACTAACAGAC 
CAGTGGGTGCAAGACAAGATCACTCAAATGAAATACGTGACTGATGTTGGTGTCCTGCAGCG 
CCACGTTTCCCGCCACAACCACCGGAACGAGGATGAGGAGAACACACTCTCCGTGGACTGCA 
CACGGATCTCCTTTGAGTATGACCTCCGCCTGGTGCTCTACCAGCACTGGTCCCTCCATGAC 
AGCCTGTGCAACACCAGCTATACCGCAGCCAGGTTCAAGCTGTGGTCTGTGCATGGACAGAA 
GCGGCTCCAGGAGTTCCTTGCAGACATGGGTCTTCCCCTGAAGCAGGTGAAGCAGAAGTTCC 
AGGCCATGGACATCTCCTTGAAGGAGAATTTGCGGGAAATGATTGAAGAGTCTGCAAATAAA 
TTTGGGAl GAAGGACATGCGCGTGCAGACTTTCAGCATTCATTTTGGGTTCAAGCACAAGTT 
TCTGGCCAGCGACGTGGTCTTTGCCACCATGTCTTTGATGGAGAGCCCCGAGAAGGATGGCT 
CAGGGACAGATCACTTCATCCAGGCTCTGGACAGCCTCTCCAGGAGTAACCTGGACAAGCTG 
TACCATGGCCTGGAACTCGCCAAGAAGCAGCTGCGAGCCACCCAGCAGACCATTGCCAGCTGC 
CTTTGCACCAACCTCGTCATCTCCCAGGGGCCTTTCCTGTACTGCTCTCTCATGGAGGGCAC 
TCCAGATGTCATGCTGTTCTCTAGGCCGGCATCCCTAAGCCTGCTCAGCAAACACCTGCTCA 
AGTCCTTTGTGTGTTCGACAAAGAACCGGCGCTGCAAACTGCTGCCCCTGGTGATGGCTGCC 
CCCCTGAGCATGGAGCATGGCACAGTGACCGTGGTGGGCATCCCCCCAGAGACCGACAGCTC 
GGACAGGAAGAACTTTTTTGGGAGGGCGTTTGAGAAGGCAGCGGAAAGCACCAGCTCCCGGA 
TGCTGCACAACCATTTTGACCTCTCAGTAATTGAGCTGAAAGCTGAGGATCGGAGCAAGTTT 
CTGGACGCACTTATTTCCCTCCTGTCCTAGGAATTTGATTCTTCCAGAATGACCTTCTTATT 
TATGTAACTGGCTTTCATTTAGATTGTAAGTTATGGACATGATTTGAGATGTAGAAGCCATT 
TTTTATTAAATAAAATGCTTATTTTAGGAAA 
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FIGURE 27 

MFVSDFRKEFYEWQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAF 
LEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDSHRPVNWNVYNDTQIKLLIKQDDD 
LEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLEEEIVEQTMRRRQRREWEARRRDILF 
D YEQ YE YHGTS S AMVMFE LAWMLS KDLNDMLWWA I VGLTDQWVQDK I TQMKYVTDVG VLQRH 
VSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHGQKR 
LQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFL 
ASDWFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCL 
CTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAP 
LSMEHGTVTWGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFL 
DALISLLS 
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FIGURE 28 

GTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGTGGCTATGNTCGTGTCCGATTTCCGCA 
AAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCCTTCTCTTCGTGGCCTCGGANGTGGAT 
GCTCTGTGTGCGTGCAAGATCGTTCAGGCCTTGTTCCAGTGTGACCANGTGCAATATANGCT 
GGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGCATTTCTTGAGCATAAAGAACAGTTTC 
ATTATTTTATTCTCATAAACTGTGGAGCTAATG^AGACCTATTGGATATTCTTCAACCTGAT 
GAAGACACTATATTCTTTGTGTGTGACACCCATAGGCCAGTCAATGTTGTCAATGTATACAA 
CGATACCC 
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FIGURE 29 

CAGGAACCCTCTCTTTGGGTCTGGATTGGGACCCCTTTCCAGTACCATTTTTTCTAGTGAAC 
CACGAAGGGACGATACCAGAAAACACCCTCAACCCAAAGGAAATAGACTACAGCCCCAATTG 
GCTGACTTTGGCTATAGAAAAAAGAAAGGAACGAAAAGAGACAGTTTTTTTTGGAAAGCTAA 
GTCTTCCCTTTATCGAGTCAAGAAACCCCCCCTTCTTGAGCTATTTACAGCTTTTAACAATT 
GAGTAAAGTACGCTCCGGTCACCATGGTGACAGCCGCCCTGGGTCCCGTCTGGGCAGCGCTC 
CTGCTCTTTCTCCTGATGTGTGAGATCCGTATGGTGGAGCTCACCTTTGACAGAGCTGTGGC 
CAGCGGCTGCCAACGGTGCTGTGACTCTGAGGACCCCCTGGATCCTGCCCATGTATCCTCAG 
CCTCTTCCTCCGGCCGCCCCCACGCCCTGCCTGAGATCAGACCCTACATTAATATCACCATC 
CTGAAGGGTGACAAAGGGGACCCAGGCCCAATGGGCCTGCCAGGGTACATGGGCAGGGAGGG 
TCCCCAAGGGGAGCCTGGCCCTCAGGGCAGCAAGGGTGACAAGGGGGAGATGGGCAGCCCCG 
GCGCCCCGTGCCAGAAGCGCTTCTTCGCCTTCTCAGTGGGCCGCAAGACGGCCCTGCACAGC 
GGCGAGGACTTCCAGACGCTGCTCTTCGAAAGGGTCTTTGTGAACCTTGATGGGTGCTTTGA 
rzVTGGCGACCGGCCAGTTTGCTGCTCCCCTGCGTGGCATCTACTTCTTCAGCCTCAATGTGC 
ACAGCTGGAATTACAAGGAGACGTACGTGCACATTATGCATAACCAGAAAGAGGCTGTCATC 
CTGTACGCGCAGCCCAGCGAGCGCAGCATCATGCAGAGCCAGAGTGTGATGCTGGACCTGGC 
CTACGGGGACCGCGTCTGGGTGCGGCTCTTCAAGCGCCAGCGCGAGAACGCCATCTACAGCA 
ACGACTTCGACACCTACATCACCTTCAGCGGCCACCTCATCAAGGCCGAGGACGACTGAGGG 
CCTCTGGGCCACCCTCCCGGCTGGAGAGCTCAGGTGCTGGTCCCGTCCCCTGCAGGGCTCAG 
TTTGCACTGCTGTGAAGCAGGAAGGCCAGGGAGGTCCCCGGGGACCTGGCATTCTGGGGAGA 
CCCTGCTTCTATCTTGGCTGCCATCATCCCTCCCAGCCTATTTCTGCTCCTCTCTTCTCTCT 
TGGACCTATTTTAAGAAGCTTGCTAACCTAAATATTCTAGAACTTTCCCAGCCTCGTAGCCC 
AGCACTTCTCAAACTTGGAAATGCATGCGAATCACCCGGGGTTCGTGTTAAATGCAGATTCT 
GACTCAGCAGGTCTGAGTGGGTCCAGGATTCTGTGTTTCTCATATGTTCCTGGGTGATGCTG 
ATGGGGTCAGTCTATGAACCACACTGGAGCAACCAGGTTCTAGGACTTTCTCAATATTCTAG 
TACTTTCTGAACATTCTGGAATCCTCCCCACATTCTAGAATTCTCCCAACATTTTTTTTTCT 
TGAGACAGAGTCTTGCTCTGTTGCCCAGGCTAGAGTGCAGTGGTGCAATCTCAGTTCACTGC 
AACCTCTGCCTCCCGGGTTCAAGCGATTCTTCTGCCTCAGCCTCCCTAGTGGCTGGGATTAC 
AGGCGCCTGCTACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATA 
TTGGCCAGGCTGGTCTTGAACTCCTGACTTCAGGTGACCCACCCGCCTCGGCCTCTCAAAAT 
GCTGGGATTACAGGTGTGAGCCACCGTG CCTGGCC AATTCCAACATTCTTAAATTCTCTCAT 
CCCTCCAGGGCTCCCCGTGCTATGTTCTCTTTACCCCTTCCCCCTCTTCTCTTGCTCAGGCC 
TGCACCACTGCAGCCACCGTTCATTTATTCATTCATTAAACACTGAGCACTCACTCTGTGCT 
GGGTCCCGGGAAGGGTGAGGGGGTCAGACACAGGCCCTGCCCCTGCCCTCAGTGACTGGCCA 
GTCCAGCCCAGGCGGGGAGAGATGTGTACATAGGTTTTAAAGCAGACCCAGAGCTCATGGGG 
GCCTGTGTTCTGGGTGTTCAGGTGCTGCTGGTCCTCCATTACCCACTGCTCCCCAAGGCTGG 
TGGGACGGGGTCCCGGTGGCAGGGGCAGGTATCTCCTTCCCGTTCCTCATCCACCTGCCCAG 
TGCTCATCGTTACAGCAAACCCCAGGGGGCCTTGGCCAGGTCAAGGGTTCTGTGAGGAGAGG 
ACCCAGGAGTGTGGGGGCATTTGGGGGGTGAAGTGGCCCCCGAAGAATGGAACCCACACCCA 
TAGCTCTCCCCACAGCTGATACGGCATCCTGCGAGAAGACCTGCCCTCCTCACTGGGATCCC 
CTTCCTGCCTCCTCCCAGGGCTCTGCCAGGGCCTTGCTCAGTCCCTTCCACCAAAGTCATCT 
GAACTTCCGTTTCCCCAGGGCCTCCAGCTGCCCTCAGACACTGATGTCTGTCCCCAGGTGCT 
CTCTGCCCCTCATGCCCCTCTCACCGGCCCAGTGCCCCGACTGTCCAGGCTTTATCAAGGTG 
CTAAGGCCCGCwTGGGCAGCTCCTCGTCTCAGAGCCCTCCTCCGGCCTGGTGCTGCCTTTAC 
AAACACCTGCAGGAGAAGGGCCACGGAAGCCCCAGGCTTTAGAGCCCTCAGCAGGTCTGGGG 
AGCTAGAGCAAAGGAGGGACCTCAGGCCTTCCGTTTCTTCTTCCAGGGTGGGGTGGCCTGGT 
GTTCCCCTAGCCTTCCAAACCCAGGTGGCCTGCCCTTCTCCCCAGAGGGAGGCGGCCTCCGC 
CCATTGGTGCTCATGCAGACTCTGGGGCTGAGGTGCCCCGGGGGGTGATCTCTGGTGCTCAC 
AGCCGAGGGAGCCGTGGCTCCATGGCCAGATGACGGAAACAGGGTCTGACCAAGTGCCAGGA 
AGACCTGTGCTATAAACCACCCTGCCTGATCCTGCCCCTGCCTGACCCCGCCACGCCCTGCC 
GTCCAGCATGATTAAAGAATGCTGTCTCCTCTTGGAAAAAAAAAAAAAAAA 
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^TAALGPWAALLLFLLMCEIRMVELTFDRAVASGCQRCCDSEDPLDPAHVSSASSSGRPH 
ALPEIRPYINITILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSKGDKGEMGSPGAPCQKRF 
FAFSVGRKTALHSGEDFQTLLFERVFWLDGCFDmTGQFAAPLRGIYFFSLWHSWNYKET 
YVHIMHNQKEAVIIiYAQPSERSIMQSQSVMLDLAYGDRVWVRLFKRQRENAIYSNDFDTYIT 
FSGHL I KAEDD 



Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site . 

amino acids 72-75 

Clq domain proteins. 

amino acids 144-178, 78-111 and 84-117 
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ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTG 
AGGCCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTG 
CTCCAGGGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGG 
GCTGCCCATCCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCAGTGCCCGCCAGGGG 
ACCACGGTGCCCCGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCAT 
CACCATGCTCGACGCAGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCAC 
AGAACCAGATCGCCAGCCTGCGCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTG 
GACCTGACGGCCAACAGGCTGCATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCT 
CGAGCGCCTCTACCTGGGCAAGAACCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGC 
TCGACCGCCTCCTGGAGCTCAAGCTGCAGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGC 
CTGCCCCGCCTGCTGCTGCTGGACCTCAGCCACAACAGCCTCCTGGCCCTGGAGCCCGGCAT 
CCTGGACACTGCCAACGTGGAGGCGCTGCGGCTGGCTGGTCTGGGGCTGCAGCAGCTGGACG 
AGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACCTGGATGTGTCCGACAACCAGCTGGAG 
CGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACGCGCCTGCGGCTGGCCGGCAACAC 
CCGCATTGCCCAGCTGCGO^CCGAGGACCTGGCCGGCCTGGCTGCCCTGCAGGAGCTGGATG 
TGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCCTCTTCCCCCGCCTGCGG 
CTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGCTGGTTTGGCCCCTG 
GGTGCGCGAGAGCCACGTGACACTGGCCAGCCCTGAGGAGACGCGCTGCCACTTCCCGGCCA 
AGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAGCCACCACC 
ACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTGTTCTAG 
CTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTCCA 
CTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCGCAGGACTGGGCACCGTCCACCTGCCTC 
AATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTT 
CACGGGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCA 
CGCCGAGGCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGGCCCACCTCCCTGCGC 
GTGGGGCTGCAGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTA 
TCGCAACCTATCGGGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTG 
AGTACACGGTCACCCAGCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGG 
CCCGGGCGGGTGCCGGAGGGCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCA 
CTCCAACCACGCCCCAGTCACCCAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCG 
CCCTGGCCGCGGTGCTCCTGGCCGCGCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGG 
GGGCGGGCCATGGCAGCAGCGGCTCAGGACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCT 
GGAACTGGAGGGAGTGAAGGTCC C CTTGGAG C C AGG CC CG AAGG CAAC AGAGGG CGGTGG AG 
AGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCCACTCATGGGCTTCCCAGGGCCTGGCCTC 
CAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCAGAGAGAGACAGGGCAGCTGGGGCCG 
GGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCCACACCACGTAAGTTCTCAGTCC 
CAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCTGGGCCCTGTTCCCTCTGGA 
CCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCCCTAACGTCCCCAGAAC 
CGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTCCCTGGGCACGGCG 
GGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCACTCCAGGCGGA 
CCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGCGGCTGTG 
TGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGCTTTA 
GGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCTG 
GGAAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATG 
AAGGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 32 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFEN 
GITMLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLR 
RLERLYLGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEP 
GILDTANVEALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAG 
NTRIAQLRPEDLAGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFG 
PWVRESHVTLASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALS 
SSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPE 
GFTGLYCESQMGQGTRPSPTPVTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRL 
TYRNLSGPDKRLVTLRLPASLAEYTVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPA 
VHSNHAPVTQAREGNLPLLIAPALAAVLLAALAAVGAAYCWRGRAHAAAAQDKGQVGPGAG 
PLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPGPGLQSPLHAKPYI 
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GAATCATCCACGCACCTGCAGCTCTGCTGAGAGAGTGCAAGCCGTGGGGGTTTTGAGCTCAT 
CTTCATCATTCATATGAGGAAATAAGTGGTAAAATCCTTGGAAATACAATGAGACTCATCAG 
AAACATTTACATATTTTGTAGTATTGTTATGACAGCAGAGGGTGATGCTCCAGAGCTGCCAG 
AAGAAAGGGAACTGATGACCAAr^GCTCCAACATGTCTCTAAGAAAGGTTCCCGCAGACTTG 
ACCCCAGCCACAACGACACTGGATTTATCCTATAACCTCCTTTTTCAACTCCAGAGTTCAGA 
TTTTCATTCTGTCTCCAAACTGAGAGTTTTGATTCTATGCCATAACAGAATTCAACAGCTGG 
ATCTCAAAACCTTTGAATTCAACAAGGAGTTAAGATATTTAGATTTGTCTAATAACAGACTG 
AAGAGTGTAACTTGGTATTTACTGGCAGGTCTCAGGTATTTAGATCTTTCTTTTAATGACTT 
TGACACCATGCCTATCTGTGAGGAAGCTGGCAACATGTCACACCTGGAAATCCTAGGTTTGA 
GTGGGGCAAAAATACAAAAATCAGATTTCCAGA/AATTGCTCATCTGCATCTAAATACTGTC 
TTCTTAGGATTCAGAACTCTTCCTCATTATGAAGAAGGTAGCCTGCCCATCTTAAACACAAC 
AAAACTGCACATTGTTTTACCAATGGACACAAATTTCTGGGTTCTTTTGCGTGATGGAATCA 
AGACTTCAAAAATATTAGAAATGACAAATATAGATGGCAAAAGCCAATTTGTAAGTTATGAA 
ATGCAACGAAATCTTAGTTTAGA-.^ATGCTAAGACATCGGTTCTATTGCTTAATAAAGTTGA 
TTTACTCTGGGACGACCTTTTCCTTATCTTACAATTTGTTTGGCATACATCAGTGGAACACT 
TTCAGATCCGAAATGTGACTTTTGGTGGTAAGGCTTATCTTGACCACAATTCATTTGACTAC 
TCAAATACTGTAATGAGAACTATAAAATTGGAGCATGTACATTTCAGAGTGTTTTACATTCA 
ACAGGATAAAATCTATTTGCTTTTGACCAAAATGGACATAGAAAACCTGACAATATCAAATG 
CACAAATGCCACACATGCTTTTCCCGAATTATCCTACGAAATTCCAATATTTAAATTTTGCC 
AATAATATCTTAACAGACGAGTTGTTTAAAAGAACTATCCAACTGCCTCACTTGAAAACTCT 
CATTTTGAATGGCAATAAACTGGAGACACTTTCTTTAGTAAGTTGCl'TTGCTAACAACACAC 
CCTTGGAACACTTGGATCTGAGTCAAAATCTATTACAACATAAAAATGATGAAAATTGCTCA 
TGGCCAGAAACTGTGGTCAATATGAATCTGTCATACAATAAATTGTCTGATTCTGTCTTCAG 
GTGCTTGCCCAAAAGTATTCAAATACTTGACCTAAATAATAACCAAATCCAAACTGTACCTA 
AAGAGACTATTCATCTGATGGCCTTACGAGAACTAAATATTGCATTTAATTTTCTAACTGAT 
CTCCCTGGATGCAGTCATTTCAGTAGACTTTCAGTTCTGAACATTGAAATGAACTTCATTCT 
CAGCCCATCTCTGGATTTTGTTCAGAGCTGCCAGGAAGTTAAAACTCTAAATGCGGGAAGAA 
ATCCATTCCGGTGTACCTGTGAATTAAAAAATTTCATTCAGCTTGAAACATATTCAGAGGTC 
ATGATGGTTGGATGGTCAGATTCATACACCTGTGAATACCCTTTAAACCTAAGGGGAACTAG 
GTTAAAAGACGTTCATCTCCACGAATTATCTTGCAACACAGCTCTGTTGATTGTCACCATTG 
TGGTTATTATGCTAGTTCTGGGGTTGGCTGTGGCCTTCTGCTGTCTCCACTTTGATCTGCCC 
TGGTATCTCAGGATGCTAGGTCAATGCACACAAACATGGCACAGGGTTAGGAAAACAACCCA 
AGAACAACTCAAGAGAAATGTCCGATTCCACGCATTTATTTCATACAGTGAACATGATTCTC 
TGTGGGTGAAGAATGAATTGATCCCCAATCTAGAGAAGGAAGATGGTTCTATCTTGATTTGC 
CTTTATGAAAGCTACTTTGACCCTGGCAAAAGCATTAGTGAAAATATTGTAAGCTTCATTGA 
GAAAAGCTATAAGTCCATCTTTGTTTTGTCTCCCAACTTTGTCCAGAATGAGTGGTGCCATT 
ATGAATTCTACTTTGCCCACCACAATCTCTTCCATGAAAATTCTGATCATATAATTCTTATC 
TTACTGGAACCCATTCCATTCTATTGCATTCCCACCAGGTATCATAAACTGAAAGCTCTCCT 
GGAAAAAAAAGCATACTTGGAATGGCCCAAGGATAGGCGTAAATGTGGGCTTTTCTGGGCAA 
ACCTTCGAGCTGCTATTAATGTTAATGTATTAGCCACCAGAGAAATGTATGAACTGCAGACA 
TTCACAGAGTTAAATGAAGAGTCTCGAGGTTCTACAATCTCTCTGATGAGAACAGATTGTCT 
ATAAAATCCCACAGTCCTTGGGAAGTTGGGGACCACATACACTGTTGGGATGTACATTGATA 
CAACCTTTATGATGGCAATTTGACAATATTTATTAAAATAAAAAATGCTTATTCCCTTCATA 
TCAGTTTCTAGAAGGATTTCTAAGAATGTATCCTATAGAAACACCTTCACAAGTTTATAAGG 
GCTTATGGAAAAAGGTGTTCATCCCAGGATTGTTTATAATCATGAAAAATGTGGCCAGGTGC 
AGTGGCTCACTCTTGTAATCCCAGCACTATGGGAGGCCAAGGTGGGTGACCCACGAGGTCAA 
GAGATGGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 
GCTGGGCGTGATGGTGCACGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCG 
CTTGAACCCGGGAGGTGGCAGTTGCAGTGAGCTGAGATCGAGCCACTGCACTCCAGCCTGGT 
G AC AGAGCGAGACT C CAT CT CAAAAAAAAG AAAAAAAAAAAAG AAAAAAATGGAAAAC AT CC 
TCATGGCCACAAAATAAGGTCTAATTCAATAAATTATAGTACATTAATGTAATATAATATTA 
CATGCCACTAAAAAGAATAAGGTAGCTGTATATTTCCTGGTATGGAAAAAACATATTAATAT 
GTTATAAACTATTAGGTTGGTGCAAAACTAATTGTGGTTTTTGCCATTGAAATGGCATTGAA 
ATAAAAGTGTAAAGAAATCTATACCAGATGTAGTAACAGTGGTTTGGGTCTGGGAGGTTGGA 
TTACAGGGAGCATTTGATTTCTATGTTGTGTATTTCTATAATGTTTGAATTGTTTAGAATGA 
ATGTGTATTTCTTTTATAAGTAGAAAAAAAATAAAGATAGTTTTTACAGCCT 
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MRLIRNIYIFCSIVMTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQ 
LQSSDFHSVSKLRVLILCHNRIQQLDLKTFEFNKELRYLDLSNNRLKSVTWYLLAGLRYLDL 
SFNDFDTMPICEEAGNMSHLEILGLSGAKIQKSDFQKIAHLHLNTVFLGFRTLPHYEEGSLP 
I LNTTKLH I VLPMDTNFWVLLRDG I KTS K I LEMTN I DGKSQFVS YEMQRNLSLENAKTS VLL 
LNKVDLLWDDLFLILQFWHTSVEHFQIRNVTFGGPCAYLDHNSFDYSNTVMRTIKLEHVHFR 
VFYIQQDKIYLLLTKMDIENLTISNAQMPHMLFPNYPTKFQYLNFANNILTDELFKRTIQLP 
HLKTLILNGNKLETLSLVSCFANNTPLEHLDLSQNLLQHKNDENCSWPETWNMNLSYNKLS 
DS VFRCLPKS IQ I LDLNNNQ IQTVPKL, riHLMALRELNIAFNFLTDLPGCSHFSRLSVLNIE 
MNFILSPSLDFVQSCQEVKTLNAGRNPFRCTCELKNFIQLETYSEVMMVGWSDSYTCEYPLN 
LRGTRLKDVHLHELSCNTALLIVTIVVIMLVLGLAVAFCCLHFDLPWYLRMLGQCTQTWHRV 
RKTTQEQLKRNVRFHAFISYSEHDSLWVKNEL I PNLEKEDGS IL I CLYES YFDPGKS I SEN I 
VS F I EKS YKS I FVLS PNFVQNEWCHYEF YF AHHNLFHENSDH 1 1 L I LLE P I P F YC I PTRYHK 
LKALLEKKAYLEWPKIJRRKCGLFWANLRAAINVNVLATREMYELQTFTELNEESRGSTISLM 
RTDCL 
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FIGURE 35A 

GGGGGCTTTCTTGGGCTTGGCTGCTTGGAACACCTGCCTCCAAGGACCGGCCTCGGAGGGGT 
CGCCGGGAAAGGGAGGGAAGAAGGAAGGGCGGGGCCGGCCCCCCTGCGCCCGCCCCGCGCCT 
CTGCGCGCCCCTGTCCGCCCCGGCCCAGCCCAGCCCAGCCCCGCGGGCCGGTCACACGCGCA 
GCCAGCCGGCCGCCTCCCGCGCCCAAGCGCGCCGCTCTGCTGTGCCCTGCGCCCTTGCCCCG 
CGCCAGCTTCTGCGCCCGCAGCCCGCCCGGCGCCCCCGGTGACCGTGACCCTGCCCTGGGCG 
CGGGGCGGAGCAGGCATGTCCCGCCCGGGGACCGCTACCCCAGCGCTGGCCCTGGTGCTCCT 
GGCAGTGACCCTGGCCGGGGTCGGAGCCCAGGGCGCAGCCCTCGAGGACCCTGATTATTACG 
GGCAGGAGATCTGGAGCCGGGAGCCCTACTACGCGCGCCCGGAGCCCGAGCTCGAGACCTTC 
TCTCCGCCGCTGCCTGCGGGGCCCGGGGAGGAGTGGGAGCGGCGCCCGCAGGAGCCCAGGCC 
GCCCAAGAGGGCCACCAAGCCCAAGAAAGCTCCCAAGAGGGAGAAGTCGGCTCCGGAGCCGC 
CTCCACCAGGTAAACACAGCAACAAAAAAGTTATGAGAACCAAGAGCTCTGAGAAGGCTGCC 
AACGATGATCACAGTGTCCGTGTGGCCCGTGAAGATGTCAGAGAGAGTTGCCCACCTCTTGG 
TCTGGAAACCTTAAAAATCACAGACTTCCAGCTCCATGCCTCCACGGTGAAGCGCTATGGCC 
TGGGGGCACATCGAGGGAGACTCAACATCCAGGCGGGCATTAATGAAAATGATTTTTATGAC 
GGAGCGTGGTGCGCGGGAAGAAATGACCTCCAGCAGTGGATTGAAGTGGATGCTCGGCGCCT 
GACCAGATTCACTGGTGTCATCACTCAAGGGAGGAACTCCCTCTGGCTGAGTGACTGGGTGA 
CATC CTATAAGGTCATGGTGAGCAATGACAG CCA CACGTGGGTCACTGTTAAGAATGGATCT 
GGAGACATGATATTTGAGGGAAACAGTGAGAAGGAGATCCCTGTTCTCAATGAGCTACCCGT 
CCCCATGGTGGCCCGCTACATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCT 
GCATGAGAATGGAGATCCTGGGCTGCCCACTGCCAGATCCTAATAATTATTATCACCGCCGG 
AACGAGATGACCACCACTGATGACCTGGATTTTAAGCACCACAATTATAAGGAAATGCGCCA 
GTTGATGAAAGTTGTGAATGAAATGTGTCCCAATATCACCAGAATTTACAACATTGGAAAAA 
GCCACCAGGGCCTGAAGCTGTATGCTGTGGAGATCTCAGATCACCCTGGGGAGCATGAAGTC 
GGTGAGCCCGAGTTCCACTACATCGCGGGGGCCCACGGCAATGAGGTGCTGGGCCGGGAGCT 
GCTGCTGCTGCTGGTGCAGTTCGTGTGTCAGGAGTACTTGGCCCGGAATGCGCGCATCGTCC 
ACCTGGTGGAGGAGACGCGGATTCACGTCCTCCCCTCCCTCAACCCCGATGGCTACGAGAAG 
GCCTACGAAGGGGGCTCGGAGCTGGGAGGCTGGTCCCTGGGACGCTGGACCCACGATGGAAT 
TGACATCAACAACAACTTTCCTGATTTAAACACGCTGCTCTGGGAGGCAGAGGATCGACAGA 
ATGTCCCCAGGAAAGTTCCCAATCACTATATTGCAATCCCTGAGTGGTTTCTGTGGGAAAAT 
GCCACGGTGGCTGCCGAGACCAGAGCAGTCATAGCCTGGATGGAAAAAATCCCTTTTGTGCT 
GGGCGGCAACCTGCAGGGCGGCGAGCTGGTGGTGGCGTATCCCTACGACCTGGTGCGGTCCC 
CCTGGAAGACGCAGGAACACACCCCCACCCCCGATGACCACGTGTTCCGCTGGCTGGCCTAC 
TCCTATGCCTCCACACACCGCCTCATGACAGACGCCCGGAGGAGGGTGTGCCACACGGAGGA 
CTTCCAGAAGGAGGAGGGCACTGTCAATGGGGCCTCCTGGCACACCGTCGCTGGAAGTCTGA 
ACGATTTCAGCTACCTTCATACAAACTGCTTCGAACTGTCCATCTACGTGGGCTGTGATAAA 
TACCCACATGAGAGCCAGCTGCCCGAO^AGTGGGAGAATAACCGGGAATCTCTGATCGTGTT 
CATGGAGCAGGTTCATCGTGGCATTAAAGGCTTGGTGAGAGATTCACATGGAAAAGGAATCC 
CAAACGCCATTATCTCCGTAGAAGGCATTAACCATGACATCCGAACAGCCAACGATGGGGAT 
TACTGGCGCCTCCTGAACCCTGGAGAGTATGTGGTCACAGCAAAGGCCGAAGGTTTCACTGC 
ATCCACCAAGAACTGTATGGTTGGCTATGACATGGGGGCCACAAGGTGTGACTTCACACTTA 
GCAAAACCAACATGGCCAGGATCCGAGAGATCATGGAGAAGTTTGGGAAGCAGCCCGTCAGC 
CTGCCAGCCAGGCGGCTGAAGCTGCGGGGGCGGAAGAGACGACAGCGTGGGTGACCCTCCTG 
GGCCCTTGAGACTCGTCTGGGACCCATGCAAATTAAACCAACCTGGTAGTAGCTCCATAGTG 
GACTCACTCACTGTTGTTTCCTCTGTAATTCAAGAAGTGCCTGGAAGAGAGGGTGCATTGTG 
AGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTTCTTTTCTTTGTTGCCATTTA 
TCCAAATAACTTGGACAGAGCAGCAGAGAAAAGCTGATGGGAGTGAGAGAACTCAGCAAGCC 
AACCTGGGAATCAGAGAGAGAAGGAGAAGGAGGGGAGCCTGTCCGTTCAGAGCCTCTGGCTGC 
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FIGURE 35B 



ATAGAAAAGGATTCTGGTGCTTCCCCTGTTTGCGTGGCAGCAAGGGTTCCACGTGCATTTGC 
AATTTGCACAGCTAAAATTGCAGCATTTCCCCAGCTGGGCTGTCCCAAATGTTACCATTTGA 
GATGCTCCCAGGCGTCCTAAGAGAATCCACCCTCTCTGGCCCTGGGACATTGCAAGCTGCTA 
CAAATAAATTCTGTGTTCTTTTGACAATAGCGTCATTGCCAAGTGCACATCAGTGAGCCTCT 
TGAATCTGTTTAGTCTCCTTTTTCAACAAAGGAGTGTGTTCAGAAAAGGAGAGAGAGGCTGA 
GATCATTCAGGAGTTTGTTGGGCAGCAAGCATGGAGCTTCTTGCACAAATTCTGGGTCCATA 
AACAACCCCCAAAGTCCCTGCTGATCCAGTAGCCCTGGAGGTTCCCCAGGTAGGGAGAGCCA 
GAGGTGCCAGCCTTCCTGAAGGGCCAGAAAATTTAGCCTGGATCTCCTCTTTTACCTGCTAG 
GACTGGAAAGAGCCAGAAGTGGGGTGGCCTGAAGCCCTCTCTCTGCTTGAGGTATTGCCCCT 
GTGTGGAATTGAGTGCTCATGGGTTGGCCTCATATCAGCCTGGGAGTTATTTTTGATATGTA 
GAATGCCAGATCTTCCAGATTAGGCTAAATGTAATGAAAACCTCTTAGGATTATCTGTGGAG 
CATCAGTTTGGGAAGAATTATTGAATTATCTTGCAAGAAAAAAGTATGTCTCACTTTTTGTT 
AATGTTGCTGCCTCATTGACCTGGGAAAAATGAAAAAAAAAAATAAAGCAAATGGTAAGACC 
CTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 36 

MSRPGTATPALALVLLAVTLAGVGAQGAALEDPDYYGQEIWSREPYYARPEPELETFSPPIiP 
AGPGEEWERRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSNKKVMRTKSSEKAANDDHS 
VRVAREDVRESCPPLGLETLKITDFQLHASTVKRYGLGAHRGRLNIQAGINENDFYDGAWCA 
GRNDLQQW I EVDARRLTR FTGV I TQGRN SLWLSDWVTS YKVMVSND SHTWVT VKNGSGDM I F 
EGNSEKEIPVLNELPVPMVARYIRINPQSWFDNGS ICMRMEILGCPLPDPNNYYHRRNEMTT 
TDDLDFKHHNYKEMRQLMKWNEMCPNITRIYNIGKSHQGLKLYAVEISDHPGEHEVGEPEF 
HYIAGAHGNEVLGRELLLLLVQFVCQEYLARNARIVHLVEETRIHVLPSLNPDGYEKAYEGG 
SELGGWSLGRWTHDGIDINNNFPDLNTLLWEAEDRQNVPRKVPNHYIAIPEWFLSENATVAA 
ETRAVIAWMEKIPFVLGGNLQGGELWAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYAST 
HRLMTDARRRVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHES 
QLPEEWENNRESLIVFMEQVHRGIKGLVRDSHGKGIPNA1 TSVEGINHDIRTANDGDYWRLL 
NPGEYWTAKAEGFTASTKNCrWGYDMGATRCDFTLSKTNMARIREIMEKFGKQPVSLPARR 
LKLRGRKRRQRG 
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CTAAGAGGACAAGATGAGGCCCGGCCTCTCATTTCTCCTAGCCCTTCTGTTCTTCCTTGGCC 

AAGCTGCAGGGGATTTGGGGGATGTGGGACCTCCAATTCCCAGCCCCGGCTTCAGCTCTTTC 

CCAGGTGTTGACTCCAGCTCCAGCTTCAGCTCCAGCTCCAGGTCGGGCTCCAGCTCCAGCCG 

CAGCTTAGGCAGCGGAGGTTCTGTGTCCCAGTTGTTTTCCAATTTCACCGGCTCCGTGGATG 

ACCGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCAGACACCACCTTTCCCGTGGACAGAGTG 

GAACGCTTGGAATTCACAGCTCATGTTCTTTCTCAGAAGTTTGAGAAAGAACTTTCTAAAGT 

GAGGG AATATGT C C AATT AATT AGTG TGTATG AAAAG AAA CTGTTAAA C CT AACTGTC CG AA 

TTGACATCATGGAGAAGGATACCATTTCTTACACTGAACTGGACTTCGAGCTGATCAAGGTA 

GAAGTGAAGGAGATGGAAAAACTGGTCATACAGCTGAAGGAGAGTTTTGGTGGAAGCTCAGA 

AATTGTTGACCAGCTGGAGGTGGAGATAAGAAATATGACTCTCTTGGTAGAG'AAGCTTGAGA 

CACTAGACAAAAACAATGTCCTTGCCATTCGCCGAGAAATCGTGGCTCTGAAGACCAAGCTG 

AAAGAGTGTGAGGCCTCTAAAGATCAAAACACCCCTGTCGTCCACCCTCCTCCCACTCCAGG 

GAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGA 

GAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTCCCCAGCATCCAAACAAA 

GGACTGTATTGGGTGGCGCCATTGAATACAGATGGGAGACTGTTGGAGTATTATAGACTGTA 

CAACACACTGGATGATTTGCTArTGTATATAAATGCTCGAGAGTTGCGGATCACCTATGGCC 

AAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACATGTACAACACCGGGAAT 

ATTGCCAGAGTTAACCTGACCACCAACACGATTGCTGTGACTCAAACTCTGCCTAATGCTGC 

CTATAATAACCGCTTTTCATATGCTAATGTTGCTTGGCAAGATATTGACTTTGCTGTGGATG 

AGAATGGATTGTGGGTTATTTATTCAACTGAAGCCAGCAGTGGTAACATGGTGATTAGTAAA 

CTCAATGACACCACACTTCAGGTGCTAAACACTTGGTATACCAAGCAGTATAAACCATCTGC 

TTCTAACGCCTTCATGGTATGTGGGGTTCTGTATGCCACCCGTACTATGAACACCAGAACAG 

AAGAGATTTTTTACTATTATGACACAAACACAGGGAAAGAGGGCAAACTAGACATTGTAATG 

CAT AAG ATGC AGGAAAAAGTGC AGAG CATTAA CT AT AA CC CTTTTG AC CAGAAACTTTATGT 

CTATAACGATGGTTACCTTCTGAATTATGATCTTTCTGTCTTGCAGAAGCCCCAG TAA GCTG 

TTTAGGAGTTAGGGTGAAAGAGAAAATGTTTGTTGAAAAAATAGTCTTCTCCACTTACTTAG 

ATATCTGCAGGGGTGTCTAAAAGTGTGTTCATTTTGCAGCAATGTTTAGGTGCATAGTTCTA 

CCACACTAGAGATCTAGGACATTTGTCTTGATTTGGTGAGTTCTCTTGGGAATCATCTGCCT 

CTTCAGGCGCATTTTGCAATAAAGTCTGTCTAGGGTGGGATTGTCAGAGGTCTAGGGGCACT 

GTGGGCCTAGTGAAGCCTACTGTGAGGAGGCTTCACTAGAAGCCTTAAATTAGGAATTAAGG 

AACTTAAAACTCAGTATGGCGTCTAGGGATTCTTTGTACAGGAAATATTGCCCAATGACTAG 

TCCTCATCCATGTAGCACCACTAATTCTTCCATGCCTGG7VAGAAACCTGGGGACTTAGTTAG 

GTAGATTAATATCTGGAGCTCCTCGAGGGACCAAATCTCCAACTTTTTTTTCCCCTCACTAG 

CACCTGGAATGATGCTTTGTATGTGGCAGATAAGTAAATTTGGCATGCTTATATATTCTACA 

TCTGTAAAGTGCTGAGTTTTATGGAGAGAGGCCTTTTTATGCATTAAATTGTACATGGCAAA 

TAAATCCCAGAAGGATCTGTAGATGAGGCACCTGCTTTTTCTTTTCTCTCATTGTCCACCTT 

ACTAAAAGTCAGTAGAATCTTCTACCTCATAACTTCCTTCCAAAGGCAGCTCAGAAGATTAG 

AACCAGACTTACTAACCAATTCCACCCCCCACCAACCCCCTTCTACTGCCTACTTTAAAAAA 

ATTAATAGTTTTCTATGGAACTGATCTAAGATTAGAAAAATTAATTTTCTTTAATTTCATTA 

TGGACTTTTATTTACATGACTCTAAGACTATAAGAAAATCTGATGGCAGTGACAAAGTGCTA 

GCATTTATTGTTATCTAATAAAGACCTTGGAGCATATGTGCAACTTATGAGTGTATCAGTTG 

TTGCATGTAATTTTTGCCTTTGTTTAAGCCTGGAACTTGTAAGAAAATGAAAATTTAATTTT 

TTTTTCTAGGACGAGCTATAGAAAAGCTATTGAGAGTATCTAGTTAATCAGTGCAGTAGTTG 

GAAACCTTGCTGGTGTATGTGATGTGCTTCTGTGCTTTTGAATGACTTTATCATCTAGTCTT 

TGTCTATTTTTCCTTTGATGTTCAAGTCCTAGTCTATAGGATTGGCAGT1 JAAATGCTTTAC 

TCCCCCTTTTAAAATAAATGATTAAAATGTGCTTTGAAAAAAAAAAAAA 



8NSDOCID: <WO . 9963088A2 l > 



WO 99/63088 PCT/US99/12252 

"5? /3iO 

FIGURE 38 

MRPGLSFLLALLFFLGQAAGDLGDVGPP I PS PGFSS FPGVDSSSSFSSSSRSGS SSSRSLGS 
GGSVSQLFSNFTGSVDDRGTCQCSVSLPDTTFPVDRVERLEFTAHVLSQKFEKELSKVREYV 
QLISVYEKKLLNLTVRIDIMEKDTISYTELDFELIKVEVKEMEKLVIQLKESFGGSSEIVDQ 
LEVEIRNMTLLVEKLETLDKNNVLAIRREIVALKTKLKECEASKDQNTPWHPPPTPGSCGH 
GGWNISKPSWQLNWRGFSYLYGAWGRDYSPQHPNKGLYWVAPLNTDGRLLEYYRLYNTLD 
DLLLYINARELRITYGQGSGTAVYNNl^YVNMYNTGNIARVNLTTNTIAVTQTLPNAAYNNR 
FSYANVAWQDIDFAVDENGLWVIYSTEASTGNMVISKLNDTTLQVLNTWYTKQYKPSASNAF 
MVCGVLYATRTMNTRTEEIFYYYDTNTGKEGKLDIVMHKMQEKVQSINYNPFDQKLYVYNDG 
YLLNYDLSVLQKPQ 
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FIGURE 39 

GCTCTGAAGACCAAGCTGAAAGAGTGTGAGGCCTCTAAAGATCAAACACCCCTGTCGTCCAC 
CCTCCTCCCACTCCAGGGAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGT 
GGTTCAGCTCAACTGGAGAGGGTT. ^ CTTATCTATATGGTGCTTGGGGTAGGGATTACTCTC 
CCCAGCATCCAAACAAAGGNATGTATTGGGNGGCGCCATTGAATACAGATGGGAGACTGTTG 
GAGTATTATAGACTGTACAACCCACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTT 
GCGGATCACCTATGGCCAAGGTAGTGGTACAGCAG r :TTACAACAACAACATGTACGTCAACA 
TGTACAACACCGGGNATATTGCCAGAGTTAACCTGACC 
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FIGURE 40 

TCTCGCAGATAGTAAATAATCTCGGAAAGGCGAGAAAGAAGCTGTCTCCATCTTGTCTGTAT 
CCGCTGCTCTTGTGACGTTGTGGAGATGGGGAGCGTCCTGGGGCTGTGCTCCATGGCGAGCT 
GG AT AC CATGTTTGTGTGGAAGTGCCCCGTGTTTGCT AT GCCG AT GCTGTCCTAGTGGAAAC 
AACTCCACTGTAACTAGATTGATCTATGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTG 
TGTAATGTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATG 
AGAAAGGTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTT 
GGTTTGGCTATGTTCTATCTTCTTCTCTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGA 
TCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTA 
TTATTGGGGCATTCTTCATTCCAGAAGGAACTTTTACAACTGTGTGGTTTTATGTAGGCATG 
GCAGGTGCCTTTTGTTTCATCCTCATACAACTAGTCTTACTTATTGATTTTGCACATTCATG 
GAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGAACTCGAGATGTTGGTATGCAGCCTTGT 
TATCAGCTACAGCTCTGAATTATCTGCTGTCTTTAGTTGCTATCGTCCTGTTCTTTGTCTAC 
TACACTCATCCAGGCAGTTGTTCAGAAAACAAGGCGTTCATCAGTGTCAACATGCTCCTCTG 
CGT 3GTGCTTCTGTAATGTCTATACTGCCAAAAATCCAAGAATCACAACCAAGATCTGGTT 
TGTTACAGTCTTCAGTAATTACAGTCTACACAATGTATTTGACATGGTCAGCTATGACCAAT 
GAACCAGAAACAAATTGCAACCCAAGTCTACTAAGCATAATTGGCTACAATACAACAAGCAC 
TGTCCCAAAGGAAGGGCAGTCAGTCCAGTGGTGGCATGCTCAAGGAATTATAGGACTAATTC 
TCTTTTTGTTGTGTGTATTTTATTCCAGCATCCGTACTTCAAACAATAGTCAGGTTAATAAA 
CTGACTCTAACAAGTGATGAATCTACATTAATAGAAGATGGTGGAGCTAGAAGTGATGGATC 
ACTGGAGGATGGGGACGATGTTCACCGAGCTGTAGATAATGAAAGGGATGGTGTCACTTACA 
GTTATTCCTTCTTTCACTTCATGCTTTTCCTGGCTTCACTTTATATCATGATGACCCTTACC 
AACTGGTCCAGGTATGAACCCTCTCGTGAGATGAAAAGTCAGTGGACAGCTGTCTGGGTGAA 
AATCTCTTCCAGTTGGATTGGCATCGTGCTGTATGTTTGGACACTCGTGGCACCACTTGTTC 
TTACAAATCGTGATTTTGACTGAGTGAGACTTCTAGCATGAAAGTCCCACTTTGATTATTGC 
TTATTTGAAAACAGTATTCCCAACTTTTGTAAAGTTGTGTATGTTTTTGCTTCCCATGTAAC 
TTCTCCAGTGTTCTGGCATGAATTAGATTTTACTGCTTGTCATTTTGTTATTTTCTTACCAA 
GTGCATTGATATGTGAAGTAGAATGAATTGCAGAGGAAAGTTTTATGAATATGGTGATGAGT 
TAGTAAAAGTGGCCATTATTGGGCTTATTCTCTGCTCTATAGTTGTGAAATGAAGAGTAAAA 
ACAAATTTGTTTGACTATTTTAAAATTATATTAGACCTTAAGCTGTTTTAGCAAGCATTAAA 
GCAAATGTATGGCTGCCTTTTGAAATATTTGATGTGTTGCCTGGCAGGATACTGCAAAGAAC 
ATGGTTTATTTTAAAATTTATAAACAAGTCACTTAAATGCCAGTTGTCTGAAAAATCTTATA 
AGGTTTTACCCTTGATACGGAATTTACACAGGTAGGGAGTGTTTAGTGGACAATAGTGTAGGTTA 
TGGATGGAGGTGTCGGTACTAAATTGAATAACGAGTAAATAATCTTACTTGGGTAGAGATGG 
CCTTTGCCAACAAAGTGAACTGTTTTGGTTGTTTTAAACTCATGAAGTATGGGTTCAGTGGA 
AATGTTTGGAACTCTGAAGGATTTAGACAAGGTTTTGAAAAGGATAATCATGGGTTAGAAGG 
AAGTGTTTTGAAAGTCACTTTGAAAGTTAGTTTTGGGCCCAGCACGGTAGCTCACCCTTGGT 
AATCCCAGCACTTTGGGAGCTTAAGTGGGTAGATTACTTGAGCCCAGGAATTCAGACCAGCT 
TGGCACATGGTGAACCTGTTCTATAAAAATAATCTGGCTTTGAGCATATGCCTGTGGTCCAG 
CACTGAGAGGCTAGTGAAGATTGCTGAGCCCAGAGCCAAAGGTTGCAGTGAGCAAGTCACGT 
CACTGCACTCTAGCTGGCACAGAGTAAGCCAAAAAAATATATATATATTGAAATCAAGGAGG 
CAAAATTTTGACAGGGAAGGAAGTAACTGCAAAACCACTAGGCTTTAGTAGGTACTTATATA 
AAATCTAGTCCAGTTCTCTCATTTAAAAAAATGAAGACACTGAAATACAGACTTAAATAGCT 
CAGATAGCTAATTAGGAAATTTCAAGTTGGCCAATAATAGCATTCTCTCTGACATTTAAAAA 
TAATTTCTATTCAAAATACATGCATATTGATTTACACCTCATACTGTGATAATTAATGTGAT 
GTGGATTGCTGGTGTCCAGCATGACCCATAAACAGGTCAGAAGAATGATGGAATGTTTTAGA 
ATAAACTCCTGCTTATAGTATACTACACAGTTCAAAAGATGTTTAAAATGCTTTTGTATTTA 
CTGCCATGTAATTGAAATATATAGATTATTGTAACCTTTCAACCTGAAAATCAAGCAGTATG 
AGAGTTTAGTTATTTGTATGTGTCACTAGTGTCTAATGAAGCTTTTAAAATCTACAATTTCT 
TCTTTAAAAATATTTATTAATGTGAATGGAATATAACAATTCAGCTTAATTCCCCAACCTTA 
TTCTGTGTGTAGACATTGTATTCCACAATTTTGAATGGCTGTGTTTTACCTCTAAATAAATG 
AATT C AG AG AAAAAAAAAAAAAAA 
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FIGURE 41 

MGSVLGLCSI^SWI PCLCGSAPCLLCRCCPSGNNSTVTRL I YALFLLVGVCVACVMLI PGME 
EQLNKIPGFCENEKGWPCNILVGYKAVYRLCFGL^FYLLLSLLMIKVKSSSDPRAAVHNG 
FWFFKFAAAIAI IIGAFFIPEGTFTTVWFYVGMAGAFCFILIQLVLLIDFAHSWNESWVEKM 
EEGNSRCWYAALLSATAIjNYLLSLVA I VLFFVYYTHPAS CSENKAF I S VNMLLCVGASVMS I 
LPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPETNCNPSLLSIIGYNTTSTVPKEGQSV 
QWWHAQGI IGLILFLLCVFYSSIRTSNNSQVNKLTLTSDESTLIEDGGARSDGSLEDGDDVH 
RAVDNERDGVTYSYSFFHFMLFLASLYIMMTLTNWSRYEPSREMKSQWTAVWVKISSSWIGI 
VL YVWTL VAP L VLTNRD FD 
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FIGURE 42 

GCGAGAAAGAAGCTGTCTCCATCTTGTCTGTATCCCGCTGCTTCTTGNGACGTTGTGGAGAT 
GGGGAGCGTCCCTGGGGCTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCC 
CCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAACAANTCCACTGTAACTAGATTGATCTA 
TGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAG 
AACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATT 
TTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCT 
CTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGAT 
TTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 43 

GTTATTGTGAACTTTGTGGAGATGGGAGGTCNTGGGGCTGTGTTCCATGGCGAGCTGGATAC 
CANGTTTGTGTGGAAGTGCCCCGTGTTTGNTATGCCGATGCTGTCCTAGTGGAAACAANTCC 
ACTGTAATTAGATTGATNTATGCACTTTTNTTGCTTGTTGGAGTANGTGTAGCTTGTGTAAT 
GTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAG 
GTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATNGTTTGTGCTTTGGTTTG 
GCTANGTTCTATNTTCTTCTCTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAG 
AGCTGCAGTGCACAATGGATTTTGGTTTTTTAAATTTGCTGCAGCAATTGCAATTATTATTG 
GGGC 
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FIGURE 44 

AAGAAGCTGTCTCCATCTTGTCTGTATCCGCTGCTCTTGTGAACGTTNTGGAGATGGGGAGC 
GTCCTTGGGGTTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCCCCGTGTT 
TGCTATGCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCACTT 
TTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAGAACAACT 
GAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATTTTGGTTG 
GCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGG^TATGTTCTATCTTCTTCTCTCTTTA 
CTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTT 
CTTTAAATTTGCTGCAGCAATTGCAA'x / ATTATTGGGGC 
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FIGURE 45 

GCTGTCCTTAGTGGAAACAANTCCAACTTGTAACTTGGATTGATCTATGCACTTTTTCCTTG 
CTTGTTGGAGTATGTGTAGCTTTGTGTAATGTTGTTCCCAGGATTGGANGAACAACTGAATA 
AGATTCCTGGATTTTTGTGAGAATGAGAAAGGTGTTGTCCCCTTGTAACATTTTTGGTTGGC 
TATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTACT 
AATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCT 
TTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGCATTCTTCATTCCAGAAGGAACTTTT 
ACAACTGTGTGGTTTTATGTAGGCATGGCAGGTGCCTTTTGTTTCATCCTCATACAACTAGT 
CTTACTTATTGATTTTGCACATTCATGG^TGAATCGTGGGTTGAAAAAATGGAAGAAGGGA 
ACTGGAGATGTTGGTATGCAGCCTTGTTATCAGCTACAGCTCTGAATTATCTGCTGTCTTTA 
GTTGCTATCGTCCTGTTCTTTGTCTACTACACTCATCCAGCCAGTTGTTCAGAAAACAAGGC 
GTTCATCAGTGTCAACATGCTCCTCTGCGTTGGTGCTTCTGTAATG 
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FIGURE 46 A 

1 CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCGCC 
'4 GGCCTCTCCAATGGCAAATGTGTGTGGCTGGAGGCGAGCGCGAGGCTTTCGGCAAAGGCAGT 
4 CGAGTGTTTGCAGACCGGGGCGAGTCCTGTGAAAGCAGATAAAAGAAAACATTTATTAACGT 
GTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACTCCCCGCGGAACATTTGGCTCCCT 
| CCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGATTCACGTCGTTTCCAGCCA 
'] AGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCC 
J CTGGTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCGCTTCCCTCCCTCGT 
1 TTCCAGCTCCTGGGCGAATCCCACATCTGTTTCAACTCTCCGCCGAGGGCGAGCAGGAGCGA 
% GAGTGTGTCGAATCTGCGAGTGAAGAGGGACGAGGGAAAAGAAACAAAGCCACAGACGCAAC 
TTG AGACT C C CG CATC C C AAAAGAAG CACC AG AT C AG C AAAAAAAGAAG ATG GGC C C C CCG A 
^ GCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAAGCTCGGCCTTC 
CTGTCGCACCACCGCCTGAAAGGCAGGTTTCAGAGGGACCGCAGGAACATCCGCCCCAACAT 
;i ; CATCCTGGTGCTGACGGACGACCAGGATGTGGAGCTGGGTTCCATGCAGGTGATGAACAAGA 
% CCCGGCGCATCATGGAGCAGGGCGGGGCGCACTTCATCAACGCCTTCGTGACCACACCCATG 
TGCTGCCCCTCACGCTCCTCCATCCTCACTGGCAAGTACGTCCACAACCACAACACCTACAC 
CAACAATGAGAACTGCTCCTCGCCCTCCTGGCAGGCACAGCACGAGAGCCGCACCTTTGCCG 
TGTACCTCAATAGCACTGGCTACCGGACAGCTTTCTTCGGGAAGTATCTTAATGAATACAAC 
GGCTCCTACGTGCCACCCGGCTGGAAGGAGTGGGTCGGACTCCTTAAAAACTCCCGCTTTTA 
g TAACTACACGCTGTGTCGGAACGGGGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGATT 
U ACCTCACAGACCTCATCACCAATGACAGCGTGAGCTTCTTCCGCACGTCCAAGAAGATGTAC 
% CCGCACAGGCCAGTCCTCATGGTCATCAGCCATGCAGCCCCCCACGGCCCTGAGGATTCAGC 
j CCCACAATATTCACGCCTCTTCCCAAACGCATCTCAGCACATCACGCCGAGCTACAACTACG 
$ CGCCCAACCCGGACAAACACTGGATCATGCGCTACACGGGGCCCATGAAGCCCATCCACATG 
GAATTCACCAACATGCTCCAGCGGAAGCGCTTGCAGACCCTCATGTCGGTGGACGACTCCAT 
GGAGACGATTTACAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATCGTATACA 
CCG CCGACCACGGTTACC AC AT CGGCCAGTTTGGCCTGGTGAAAGGGAAATCCATGCCAT AT 
GAGTTTGACATCAGGGTCCCGTTCTACGTGAGGGGCCCCAACGTGGAAGCCGGCTGTCTGAA 
TCCCCACATCGTCCTCAACATTGACCTGGCCCCCACCATCCTGGACATTGCAGGCCTGGACA 
TACCTGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCGGTGAAT 
CGGTTTCACTTGAAAAAGAAGATGAGGGTCTGGCGGGACTCCTTCTTGGTGGAGAGAGGCAA 
GCTGCTACACAAGAGAGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAAGT 
ACCAGCGTGTGAAGGACCTGTGTCAGCGTGCTGAGTACCAGACGGCGTGTGAGCAGCTGGGA 
CAGAAGTGGCAGTGTGTGGAGGACGCCACGGGGAAGCTGAAGCTGCATAAGTGCAAGGGCCC 
CATGCGGCTGGGCGGCAGCAGAGCCCTCTCCAACCTCGTGCCCAAGTACTACGGGCAGGGCA 
$ GCGAGGCCTGCACCTGTGACAGCGGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAA 
CTCTTCAAGAAGAAGTACAAGGCCAGCTATGTCCGCAGTCGCTCCATCCGCTCAGTGGCCAT 
CGAGGTGGACGGCAGGGTGTACCACGTAGGCCTGGGTGATGCCGCCCAGCCCCGAAACCTCA 
| CCAAGCGGCACTGGCCAGGGGCCCCTGAGGACCAAGATGACAAGGATGGTGGGGACTTCAGT 
GGCACTGGAGGCCTTCCCGACTACTCAGCCGCCAACCCCATTAAAGTGACACATCGGTGCTA 
CATCCTAGAGAACGACACAGTCCAGTGTGACCTGGACCTGTACAAGTCCCTGCAGGCCTGGA 
AAGACCACAAGCTGCACATCGACCACGAGATTGAAACCCTGCAGAACAAAATTAAGAACCTG 
AGGGAAGTCCGAGGTCACCTGAAGAAAAAGCGGCCAGAAGAATGTGACTGTCACAAAATCAG 
CTACCACACCCAGCACAAAGGCCGCCTCAAGCACAGAGGCTCCAGTCTGCATCCTTTCAGGA 
AGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTTGCGGGAGCAGAAGCGCAAGAAGAAACTC 
% CGCAAGCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCCAGGCCTCACGTGCTT 
CACCCACGACAACCAGCACTGGCAGACGGCGCCTTTCTGGACACTGGGGCCTTTCTGTGCCT 
GCACCAGCGCCAACAATAACACGTACTGGTGCATGAGGACCATCAATGAGACTCACAATTTC 



BNSDOCID: <WO 9963088A2_.I„> 



WO 99/63088 



PCT/US99/12252 



FIGURE 46B 

CTCTTCTGTGAATTTGCAACTGGCTTCCTAGAGTACTTTGATCTCAACACAGACCCCTACCA 
GCTGATGAATGCAGTGAACACACTGGACAGGGATGTCCTCAACCAGCTACACGTACAGCTCA 
TGGAGCTGAGGAGCTGCAAGGGTT\ ?AAGCAGTGTAACCCCCGGACTCGAAACATGGACCTG 
GATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAAAGTGGCCAGAAATGAAGAG 
ACCTTCTTCCAAATCACTGGGACAACTGTGGGAAGGCTGGGAAGGTTAAGAAACAACAGAGG 
TGGACCTCCAAAAACATAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTG 
ATTTCCAGCAGACCTGTGCTATTGGCCAGGAGGCCGAGAAAGCAAGCACGCACTCTCAGTC 
AACATGACAGATTCTGGAGGATAACCAGCAGGAGCAGAGATAACTTCAGGAAGTCCATTTTT 
GCCCCTGCTTTTGCTTTGGATTATACCTCACCAGCTGCACAAAATGCATTTTTTCGTATCAA 
AAAGTCACCAGTAACCCTCCCCCAGAAGCTCACAAAGGAAAACGGAGAGAGCGAGCGAGAGA 
GATTTCCTTGGAAATTTCTCCCAAG^^CGAAAGTCATTGGAATTTTTAAATCATAGGGGAAA 
AGCAGTCCTGTTCTAAATCCTCTTATTCTTTTGGTTTGTCACAAAGAAGGAACTAAGAAGCA 
GGACAGAGGCAACGTGGAGAGGCTGAAAACAGTGCAGAGACGTTTGACAATGAGTCAGTAGC 
ACAAAAGAGATGACATTTACCTAGCACTATAAACCCTGGTTGCCTCTGAAGAAACTGCCTTC 
ATTGTATATATGTGACTATTTACATGTAATCAACATGGGAACTTTTAGGGGAACCTAATAAG 
AAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAAAGAAAAA 
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FIGURE 47 

MGPPSLVLCLLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNIRPNI ILVLTDDQDVELGSMQ 
VMNKTRRIMEQGGAHFINAFVTTPMCCPSRSS ILTGKYVHNHNTYTNNENCSSPSWQAQHES 
RTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKiaSRFYNYTLCRNGVKEKHGSD 
YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHITP 
SYOTAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNT 
Y I VYTADHGYH I GQFGLVKGKSMP YE FD IRVP F Y VRG PNVEAGCLNPH I VLNTDLAPT I LD I 
AGLD I PADMDGKS I LKLLDTERPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEEN 
FLPKYQRVKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGPMRLGGSRALSNLVPKY 
YGQGSEACTCDSGDYKLSLAGRRKKLFKKKYKASYVRSRSIRSVAIEVDGRVYHVGLGDAAQ 
PRNLTKRHWPGAPEDQDDKDGGDFSGTGGLPDYSAANP1KVTHRCYILENDTVQCDLDLYKS 
LQAWKDHKLHIDHEIETLQNKIKNLREVRGHLKKKRPEECDCHKISYHTQHKGRLKHRGSSL 
HPFRKGLQEKDKVWLLREQKRKKKLRKLLKRLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLG 
PFCACTSANIOTTYWCMRTINETHNFLFCEFATGFLEYFDLNTDPYQLMNAVNTLDRDVLNQL 
HVQLMELRSCKGYKQCNPRTRNMDLDGGSYEQYRQFQRRKWPEMKRPSSKSLGQLWEGWEG 
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FIGURE 48 

AACAAAGTTCAGTGACTGAGAGGGCTGAGCGGAGGCTGCTGAAGGGGAGAAAGGAGTGAGGA 
GCTGCTGGGCAGAGAGGGACTGTCCGGCTCCCAGATGCTGGGCCTCCTGGGGAGCACAGCCC 
TCGTGGGATGGATCACAGGTGCTGCTGTGGCGGTCCTGCTGCTGCTGCTGCTGCTGGCCACC 
TGCCTTTTCCACGGACGGCAGGACTGTGACGTGGAGAGGAACCGTACAGCTGCAGGGGGAAA 
CCGAGTCCGCCGGGCCCAGCCTTGGCCCTTCCGGCGGCGGGGCCACCTGGGAATCTTTCACC 
ATCACCGTCATCCTGGCCACGTATCTCATGTGCCGAATGTGGGCCTCCACCACCACCACCAC 
CCCCGCCACACCCCTCACCACCTCCACCACCACCACCACCCCCACCGCCACCATCCCCGCCA 
CGCTCGCTGAGGCTGCTGTCGCCGGTGCCTGTGGACAGCAGCTGCCCCTGCCCTCCCATCTG 
TTCCCAGGACAAGTGGACCCCATGTTTCCATGTGGAAGGATGCATCTCTGGGGTGAACGAGG 
GGAACAATAGACTGGGGCTTGCTCCAGCTGCATTTGCATGGCATGCCCCAGTGTACTATGGC 
AGCAGAGAATGGAGGAACACTGGGTCTGCAGTGCTGAAGGGTTTGGGGAGTGGAGAGCAAGG 
GTGCTCTTTCGGGGCTGGACAGCCCGTCTTGTGACAGTGACTCCCAGTGAGCCCCAGAAATG 
ACAAGCGTGTCTTGGCAGAGCCAGCACACAAGTGGATGTGAAGTGCCCGTCTTGACCTCCTC 
ATCAGGCTGCTGCAGGCCTCTGGCGGGCAGGGCACTGGGAGAGGCCCTGAGAATGTCCTTTT 
GGTTTGGAGAAGGCAGTGTGAGGCTGCACAGTCAATTCATCGGTGCCTTAGTCCAAGAAAAT 
AAAAACCACTAAGAAGCTTTAAAAAAAAAAAAAAAAAAAAA 



_._9963088A2..I 



WO 99/63088 . / _^ Q PCT/US99/12252 



0^1 h- 

FIGURE 49 



MLGLLGSTALVGWI TGAAVAVLLLLLLLAT CL FHGRQD CDVERNRTAAGGNRVRRAQP WP FR 
RRGHLGIFHHHRHPGHVSHVPNVGLHHHHHPRHTPHHLHHHHHPHRHHPRHAR 
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FIGURE 50 

GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCCATGTCGGACCTGCTA 
CTACTGGGCCTGATTGGGGGCCTGACTCTCTTACTGCTGCTGACGCTGCTGGCCTTTGCCGG 
GTACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGTCA 
CTGTGGCCTACAAGTTCCACATGGGGCTCTATGGTGAGACTGGGCGGCTTTTCACTGAGAGC 
TGCAGCATCTCTCCCAAGCTCCGCTCCATCGCTGTCTACTATGACAACCCCCACATGGTGCC 
CCCTGATAAGTGCCGATGTGCCGTGGGCAGCATCCTGAGTGAAGGTGAGGAATCGCCCTCCC 
CTGAGCTCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTTCCCGGCACCCAGC 
CATGTGGTGACAGCCACCTTCCCCTACACCACCATTCTGTCCATCTGGCTGGCTACCCGGCG 
TGTCCATCCTGCCTTGGACACCTACATCAAGGAGCGGAAGCTGTGTGCCTATCCTCGGCTGG 
AGATCTACCAGGAAGACCAGATCCATTTCATGTGCCCACTGGCACGGCAGGGAGACTTCTAT 
GTGCCTGAGATGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGTGGAGGCCATTGACACCCA 
GGTGGATGGCACAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTTGGAAGTGAGCC 
CTGGCAGCCGGGAGACTTCAGCTGCCACACTGTCACCTGGGGCGAGCAGCCGTGGCTGGGAT 
GACGGTGACACCCGCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCCTCTTTTGA 
GGAGCTGGACTTGGAGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGC 
CCCTGGGGACTACCAAGTGGCTCTGGGAGCCCACTGCCCCTGAGAAGGGCAAGGAGTAACCC 
ATGGCCTGCACCCTCCTGCAGTGCAGTTGCTGAGGAACTGAGCAGACTCTCCAGCAGACTCT 
CCAGCCCTCTTCCTCCTTCCTCTGGGGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGC 
TCCAGGCCTCTTGCTAAGCCTTCTCCTCACTGCCCTTTAGGCTCCCAGGGCCAGAGGAGCCA 
GGGACTATTTTCTGCACCAGCCCCCAGGGCTGCCGCCCCTGTTGTGTCTTTTTTTCAGACTC 
ACAGTGGAGCTT C CAGGACC CAGAAT AAAG C C AATG ATTT ACTTGTTTC ACCTGG AAAAAAA 
AAAAAAAAAA 
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FIGURE 51 

MSDLLLLGL I GGLTLLLLLTLLAF AGYS GLLAGVE VS AGS P P I RNVTVAYKFHMGL YGETGR 
LFTESCS I SPKLRS I AVYYDNPHMVPPDKCRCAVGS I LSEGEES P S PEL I DL YQKFGFKVFS 
FPAPSHWTATFPYTT ILS I WLATRRVH PALDTY I KERKL CAYPRLE I YQEDQ I HFMCPLAR 
QGDFYVPEMKETEWKWRGLVEAIDTQVDGTGADTMSDTSSVSLEVSPGSRETSAATLSPGAS 
$ SRGWDDGDTRSEHSYSESGASGSSFEELDLEGEGPL aESRLDPGTEPLGTTKWLWEPTAPEKGKE 
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CCGCGGGAACGCTGTCCTGGCTGCCGCCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCT 
GCCCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCT 
GCTGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGA 
CCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGA 
GACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCT 
GACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGA 
GTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTAT 
GGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCT 
GATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAG 
GGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAAT 
AGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCAAAAAGAAATA 
ATAAATAATAAATTTTAAAAAACTTAAAAAAAAAAAAAAAAAA 
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FIGURE 53 

MTLRPSLLPLHLLLLLLLSAAVCRAEAGLETESPVRTLQVETLVEPPEPCAEPAAFGDTLHI 
HYTGSLVDGR I IDTSLTRDPLVIELGQKQVIPGLEQSLLDMCVGEKRRAI I PSHLAYGKRGF 
P PS VPADAWQ YD VEL I AL I RANYWL KLVKG I LP LVGMAM VP AL LGL I GYHL YRKANRPKVS 
KKKLKEEKRNKSKKK 
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FIGURE 54 

CCCGGGAACGTGTTCCTGGCTGCCGCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCTGC 
CCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCTGC 
TGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGACC 
CTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGAGA 
CACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCTGA 
CCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGAGT 
CTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTATGG 
AAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCTGA 
TTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAGGG 
ATGGCCATGGTGCCACCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAATAGA 
CCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCAAAAAGAAATAATA 
AATAATAAATTTTAAAAAACTTA 
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FIGURE 55 

CCGAAAGTCCCGTCCGGACCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCC 
GAGCCCGCTGCTTTTGGAGACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACG 
TATTATTGACACCTCCCTGACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGA 
TTCCAGGTCTGGAGCAGAGTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATT 
CCTTCTCACTTGGCCTATGGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGT 
GCAGTATGACGTGGAGCTGATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGG 
GCATTTTGCCTCTGGTAGGGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCAC 
CTATACAGAAAGGCCAATAGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAA 
CAAGAGCAAAAAGAAATAATAAATAATAAATTTTAAAAAACTTAAAA 
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FIGURE 56 

CTGCTGCATCCGGGTGTCTGGAGGCTGTGGCCGTTTTGTTTTCTTGGCTAAAATCGGGGGAG 
TGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACCACTGCACGACGGGGCTGGACTG 
ACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGG 
GAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGAT 
TATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCT 
GTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGA 
GGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGG 
TTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTG 
CTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATCTTT 
TTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGACTTATGGCAGTGAACACATCTGAT 
TTCCCACAGCACAACAGCCCTGCATGGGTTTGTTTGTTTTTTTACTGCTCACTCCCAACCTT 
TTGT7VATGCCATTTTCTAAACTTATTTCTGAGTGTAGTCTCAGCTTAAAGTTGTGTAATACT 
AAAATCACGAGAACACCTAAACAACAACCAAAAATCTATTGTGGTATGCACTTGATTAACTT 
ATAAAATGTTAGAGGAAACTTTCACATGAATAATTTTTGTCAAATTTTATCATGGTATAATT 
TGTAAAAATAAAAAGAAATTACAAAAGAAATTATGGATTTGTCAATGTAAGTATTTGTCATA 
TCTGAGGTCCAAAACCACAATGAAAGTGCTCTGAAGATTTAATGTGTTTATTCAAATGTGGT 
CTCTTCTGTGTCAAATGTTAAATGAAATATAAACATTTTTTAGTTTTTAAAATATTCCGTGG 
TCAAAATTCTTCCTCACTATAATTGGTATTTACTTTTACCAAAAATTCTGTGAACATGTAAT 
GTAACTGGCTTTTGAGGGTCTCCCAAGGGGTGAGTGGACGTGTTGGAAGAGAGAAGCACCAT 
GGTCCAGCCACCAGGCTCCCTGTGTCCCTTCCATGGGAAGGTCTTCCGCTGTGCCTCTCATT 
CCAAGGGCAGGAAGATGTGACTCAGCCATGACACGTGGTTCTGGTGGGATGCACAGTCACTC 
CACATCCACCACTG 
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FIGURE 57 

MSGFLEGLRCSECIDWGEKRNTIASIAAGVLFFTGWWI I IDAAVIYPTMKDFNHSYHACGVI 
ATIAFLMINAVSNGQVRGDSYSEGCLGQTGARIWLFVGFMLAFGSLIASMWILFGGYVAKEK 
DIVYPGIAVFFQNAFIFFGGLVFKFGRTEDLWQ 
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TTCTTGGCTAAAATCGGGGGAGTGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACC 
ACTGCACGACGGGGCTGGACTGACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATG 
CTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTAC 
TATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGAT 
TTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGC 
AGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTG 
CTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGG 
ATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATT 
TTTCCAGAATGCCTTCATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGC 
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FIGURE 59 

TGGACGGACCTGAAAAAAATGTTTGGATTTNTAGAGGGNTTGAGATGTTCAGAATGCATGAC 
TGGGGGAAAAGCGCAAATACTATTGCTTCCATTGCTGCTGGTGTANTATTTTTTACAGGCTG 
GTGGATTATCATAGATGCAGNTGTTATTTATCCCACCATGAAAGATTTCAACCANTCATACC 
ATGCCTGTGGTGTTATAGCAACCATAGCCTTCNTAATGATTAATGCAGTATCGAATGGACAA 
GTCCGAGGTGATAGTTACAGTGAAGGTTGTTTGGGTCAAACAGGTGCTCGCATTTGGCTTTT 
CGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTT 
ATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGNTGTATTTTTCCAGAATGCCTTC 
ATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGANTTATGGCAGTG 
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FIGURE 60 

GGACACCGGGTTCCGGACCAATGCANGACGGGGTGGANTGACCTGAAAAAAATGTTTGGATT 
TTTAGAGGGCTTGAGATGNTCAGAATGCATTGACTGGGGGAAAAGCGCAATANTATTGCTTT 
CCATTGCTGCTGGTGTACTATTTTTTACAGGGTGGTGGATTATCATAGATGCAGCTGTTATT 
TATCCCACCATGAAAGATTTNAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGC 
CTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTT 
GTTTGGGTCAAACAGGTGNTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATTT 
CTGATTGNATTCTATGCGGATTCTTCTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTAT 
ACCCTGGAATTNCTNTATTTTTCCAGAATGCC 
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FIGURE 61 

TAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATANTATTGCTTCC 
ATTGNTGNTGGTGTANTATTTTTTTACAGGCTGGTGGATTATNATAGATGCAGCTGTTATTT 
ATCCCACCATGAAAGATTTNAACCANTCATACCATGCCTGTGGTGTTATAGCAACCATAGCC 
TTCCTAATGATTAATGCAGTATNGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTG 
TTTGGGTCAAACAGGTGNTNGCATTTGGCTTTTNGTTGGTTTCATGTTGGCCTTTGGATCTN 
TGATTGCATTTATGTGGATTNTTTTTGGAGGTTATGTTGCTAAAGNAAAAGACATAGTATAC 
CCTGT 
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FIGURE 62 



GGGAGGCTGTGNCCGTTTTGTTTTNTTGGCTAAAATCGGGGGAGTGAGGCGGCCCGGCGCGG 
CGNGACACCGGGTTCCGGGAACCATTGCACGACGGGGTGGACTGACCTGAAAAAAATGTTTG 
GATTTNTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATT 
GCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGT 
TATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCA 
TAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAA 
GGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGG 
ATNTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAG 
TATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATNTTTTTTGGAGGGCTG 
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FIGURE 63 



CGACGCCGGCGTGATGTGGCTTCCGCTGGTGCTGCTCCTGGCTGTGCTGCTGCTGGCCGTCC 

TCTGCAAAGTTTACTTGGGACTATTCT CTGGC AG CTCCCCGAATCCTTTCTCCGAAGATGTC 

AAACGGCCCCCAGCGCCCCTGGTAACTGACAAGGAGGCCAGGAAGAAGGTTCTCAAACAAGC 

TTTTTCAGCCAACCAAGTGCCGGAGAAGCTGGATGTGGTGGTAATTGGCAGTGGCTTTGGGG 

GCCTGGCTGCAGCTGCAATTCTAGCTAAAGCTGGCAAGCGAGTCCTGGTGCTGGAACAACAT 

ACCAAGGCAGGGGGCTGCTGTCATACCTTTGGAAAGAATGGCCTTGAATTTGACACAGGAAT 

CCATTACATTGGGCGTATGGAAGAGGGCAGCATTGGCCGTTTTATCTTGGACCAGATCACTG 

AAGGGCAGCTGGACTGGGCTCCCCTGTCCTCTCCTTTTGACATCATGGTACTGGAAGGGCCC 

AATGGCCG AAAGGAGTAC C C CATGTACAGT GGAG AGAAAG CCTACATTCAGGGC CT CAAGG A 

GAAGTTTCCACAGGAGGAAGCTATCATTGACAAGTATATAAAGCTGGTTAAGGTGGTATCCA 

GTGGAGCCCCTCATGCCATCCTGTTGAAATTCCTCGCATTGCCCGTGGTTCAGCTCCTCGAC 

AGGTGTGGGCTGCTGACTCGTTTCTCTCCATTCCTTCAAGCATCCACCCAGAGCCTGGCTGA 

G ^TCCT G CAG C AGC TGGGGGCCT CCTCTG AG CT C C AGG C AG T AC TCAGCTAC AT CTTCC CCA 

CTTACGGTGTCACCCCCAACCACAGTGCCTTTTCCATGCACGCCCTGCTGGTCAACCACTAC 

ATGAAAGGAGG CTTTTATCCCCGAGGGGGTTCC AG TG AAATTGCCTTCCACACCATCCCTGT 

GATTCAGCGGGCTGGGGGCGCTGTCCTCACAAAGGCCACTGTGCAGAGTGTGTTGCTGGACT 

CAGCTGGGAAAGCCTGTGGTGTCAGTGTGAAGAAGGGGCATGAGCTGGTGAACATCTATTGC 

CCCATCGTGGTCTCCAACGCAGGACTGTTCAACACCTATGAACACCTACTGCCGGGGAACGC 

CCGCTGCCTGCCAGGTGTGAAGCAGCAACTGGGGACGGTGCGGCCCGGCTTAGGCATGACCT 

CTGTTTTCATCTGCCTGCGAGGCACCAAGGAAGACCTGCATCTGCCGTCCACCAACTACTAT 

GTTTACT AT G AC ACGGA CATGG AC C AGGCG AT GG AG CGCT AC GTCTCCATGCCCAGGGAAGA 

GGCTGCGGAACACATCCCTCTTCTCTTCTTCGCTTTCCCATCAGCCAAAGATCCGACCTGGG 

AGGACCGATTCCCAGGCCGGTCCACCATGATCATGCTCATACCCACTGCCTACGAGTGGTTT 

GAGGAGTGGCAGGCGGAGCTGAAGGGAAAGCGGGGCAGTGACTATGAGACCTTCAAAAACTC 

CTTTGTGGAAGCCTCTATGTCAGTGGTCCTGAAACTGTTCCCACAGCTGGAGGGGAAGGTGG 

AGAGTGTGACTGCAGGATCCCCACTCACCAACCAGTTCTATCTGGCTGCTCCCCGAGGTGCC 

TGCTACGGGGCTGACCATGACCTGGGCCGCCTGCACCCTTGTGTGATGGCCTCCTTGAGGGC 

CCAGAGCCCCATCCCCAACCTCTATCTGA'CAGGCCAGGATATCTTCACCTGTGGACTGGTCG 

GGGCCCTGCAAGGTGCCCTGCTGTGCAGCAGCGCCATCCTGAAGCGGAACTTGTACTCAGAC 

CTTAAGAATCTTGATTCTAGGATCCGGGCACAGAAGAAAAAGAATTAGTTCCATCAGGGAGG 

AGTCAGAGGAATTTGCCCAATGGCTGGGGCATCTCCCTTGACTTACCCATAATGTCTTTCTG 

CATTAGTTCCTTGCACGTATAAAGCACTCTAATTTGGTTCTGATGCCTGAAGAGAGGCCTAG 

TTTAAATCACAATTCCGAATCTGGGGCAATGGAATCACTGCTTCCAGCTGGGGCAGGTGAGA 

TCTTTACGCCTTTTATAACATGCCATCCCTACTAATAGGATATTGACTTGGATAGCTTGATG 

TCTCATGACGAGCGGCGCTCTGCATCCCTCACCCATGCCTCCTAACTCAGTGATCAAAGCGA 

ATATTCCATCTGTGGATAGAACCCCTGGCAGTGTTGTCAGCTCAACCTGGTGGGTTCAGTTC 

TGTCCTGAGGCTTCTGCTCTCATTCATTTAGTGCTACGCTGCACAGTTCTACACTGTCAAGG 

GAAAAGGGAGACTAATGAGGCTTAACTCAAAACCTGGGCGTGGTTTTGGTTGCCATTCCATA 

GGTTTGGAGAGCTCTAGATCTCTTTTGTGCTGGGTTCAGTGGCTCTTCAGGGGACAGGAAAT 

GCCTGTGTCTGGCCAGTGTGGTTCTGGAGCTTTGGGGTAACAGCAGGATCCATCAGTTAGTA 

GGGTGCATGTCAGATGATCATATCCAATTCATATGGAAGTCCCGGGTCTGTCTTCCTTATCA 

TCGGGGTGGCAGCTGGTTCTCAATGTGCCAGCAGGGACTCAGTACCTGAGCCTCAATCAAGC 

CTTATCCACCAAATACACAGGGAAGGGTGATGCAGGGAAGGGTGACATCAGGAGTCAGGGCA 

TGGACTGGTAAGATGAATACTTTGCTGGGCTGAAGCAGGCTGCAGGGCATTCCAGCCAAGGG 

CACAGCAGGGGACAGTGCAGGGAGGTGTGGGGTAAGGGAGGGAAGTCACATCAGAAAAGGGA 

AAGCCACGGAATGTGTGTGAAGCCCAGAAATGGCATTTGCAGTTAATTAGCACATGTGAGGG 

TTAGACAGGTAGGTGAATGCAAGCTCAAGGTTTGGAAAAATGACTTTTCAGTTATGTCTTTG 

GTATCAGACATACGAAAGGTCTCTTTGTAGTTCGTGTTAATGTAACATTAATAAATTTATTG 

ATTCCATTGCTTTAAAAAAAAAAAAAAA 
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FIGURE 64 

MWLPLVLLLAVLLLAVLCKVYLGLFSGSSPNPFSEDVKRPPAPLVTDKEARKKVLKQAFSAN 
QVPEKLDVWIGSGFGGLAAAAILAKAGKRVLVLEQHTKAGGCCHTFGKNGLEFDTGIHYIG 
RMEEGSIGRFILDQITEGQLDWAPLSSPFDIMVLEGPNGRKEYPMYSGEKAYIQGLKEKFPQ 
EEAIIDKYIKLVKWSSGAPHAILLKFLPLPWQLLDRCGLLTRFSPFLQASTQSLAEVLQQ 
LGASSELQAVLSYIFPTYGVTPMHSAFSMHALLVNHYMKGGFYPRGGSSEIAFHTIPVIQRA 
GGAVLTKATVQSVLLDSAGKACGVSVKKGHELVNIYCPIWSNAGLFNTYEHLLPGNARCLP 
GVKQQLGTVRPGLGMTSVF I CLRGTKEDLHLPSTNYYVYYDTDMDQAMERYVSMPREEAAEH 
IPLLFFAFPSAKDPTWEDRFPGRSTMIMLIPTAYEWFEEWQAELKGKRGSDYETFKNSFVEA 
SMSWLKLFPQLEGKVESVTAGSPLTNQFYLAAPRGACYGADHDLGRLHPCVMASLRAQSPI 
PNLYLTGQDI FTCGLVGALQGALLCSSAI LKRNLYSDLKNLDSR I RAQKKKN 
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FIGURE 65 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCTA 

GGGGTTGGCACCGGCCCCGAGAGGAGGATGCGGGTCCGGATAGGGCTGACGCTGCTGCTGTG 

TGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGGATGAAGAAGGCAGCCAGGATGAATCCT 

TAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTAAAGGACCATACTACTGCAGGCAGA 

GTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGAATTAGAATCCTCTATTCAAGA 

AGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAGAAGATATCAGCTTTCTAG 

AGT C TC CAAAT C C AG AAAAC AAGGAC TATGAAGA G C C AAAG AAAGT ACGG AAAC C AG CTTTG 

ACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCTTTTCCTAGATAA 

GG AG TAT G AT GAAT GT AC AT CAGATGGG AG GGAAG AT GGCAGACTGTGGTGTGCT AC AACCT 

ATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCTAAGAGA 

CGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAGCAA 

TAAGAAAAG C C AAAAAAG AGAAGCATATCGGT AT CT C CAAAAGG C AGC AAGC AT G AAC CAT A 

CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATC 

CAGG CAG CGAGAGAGATGT TTGAG AAG CTG A C TG AG G AAG GC T C T C C C AAGG G ACAG ACT G C 

TCTTGGCTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTAT 

ATTATACATTTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT 

TAGTGGAAGGCTAATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTT 

TCAGCTTTCATGATCCAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAA 

ATTCTTGTTAATGGATATAACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACA 

ATTTTTCTTTAAAATGATTAGTTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGC 

TCTTTTTAAATTTTCTCTGAGTTGGAATTGTCAGAATCATTTTTTACATTAGATTATCATAA 

TTTTAAAAATTTTTCTTTAGTTTTTCAAAATTTTGTAAATGGTGGCTATAGAAAAACAACAT 

GAAATATT£ TACAATATTTTGCAACAATGCCCTAAGAATTGTTAAAATTCATGGAGTTATl T 

GTGCAGAATGACTCCAGAGAGCTCTACTTTCTGTTTTTTACTTTTCATGATTGGCTGTCTTC 

CCATTTATTCTGGTCATTTATTGCTAGTGACACTGTGCCTGCTTCCAGTAGTCTCATTTTCC 

CT ATTTTG CTAATTTGT T ACTTTTT CTTTG CTAATTTGG AAG ATT AAC TC AT TTTTAAT AAA 

ATTATGTCTAAGATTAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 66 

MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLD 
SEESELESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHG 
EPCHFPFLFLDKEYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMM 
YQTGMKILNGSNKKSQKREAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEK 
LTEEGSPKGQTALGFLYASGLGVNSSQAKALVYYTFGALGGNL1AHMVLVSRL 
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FIGURE 67 

CTTCCCAGCCCTGTGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCT 
GCCTCCCTGCCTCTGGCCATGGCCTGCCGGTGCCTCAGCTTCCTTCTGATGGGGACCTTCCT 
GTCAGTTTCCCAGACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGCCAAGTGG 
CTCAACTCTCCTGCACGCTCAGCCCCCAGCACGTCACCATCAGGGACTACGGTGTGTCCTGG 
TACCAGCAGCGGGCAGGCAGTGCCCCTCGATATC^CCTCTACTACCGCTCGGAGGAGGATCA 
CCACCGGCCTGCTGACATCCCCGATCGATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCT 
GTGTCCTCACCATTAGTCCCGTGCAGCCTGAAGACGACGCGGATTACTACTGCTCTGTTGGC 
TACGGCTTTAGTCCCTAGGGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCTCCCATTTCT 
GCCCCTGACCTTGGGTCCCTTTTAAACTTTCTCTGAGCCTTGCTTCCCCTCTGTAAAATGGG 
TTAATAATATTCAACATGTCAACAAC 
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FIGURE 68 

MACRCLSFLLMGTFLSVSQTVLAQLDALLVFPGQVAQLSCTLSPQHVTIRDYGVSWYQQRAG 
SAP R YLL Y YRS E E DHHR PAD I PDRFS AAKDEAHNACVLT I S P VQ PEDDADYYCS VGYGFS P 
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GCCGCCCCGCCCCGAGACCGGGCCCGGGGGCGCGGGGCGGCGGGATGCGGCGCCCGGGGCGG 

CGATGACCGCGGAGCGCACGCCGCGGGCCCGGCCCTGACCCCGCCGCCCGCCCGCTGAGCCC 

CCCGCCGAGGTCCGGACAGGCCGAGATGACGCCGAGCCCCCTGTTGCTGCTCCTGCTGCCGC 

CGCTGCTGCTGGGGGCCTTCCCACCGGCCGCCGCCGCCCGAGGCCCCCCAAAGATGGCGGAC 

AAGGTGGTCCCACGGCAGGTGGCCCGGCTGGGCCGCACTGTGCGGCTGCAGTGCCCAGTGGA 

GGGGGACCCGCCGCCGCTGACCATGTGGACCAAGGATGGCCGCACCATCCACAGCGGCTGGA 

GCCGCTTCCGCGTGCTGCCGCAGGGGCTGAAGGTGAAGCAGGTGGAGCGGGAGGATGCCGGC 

GTGTACGTGTGCAAGGCCACCAACGGCTTCGGCAGCCTGAGCGTCAACTACACCCTCGTCGT 

GCTGGATGACATTAGCCCAGGGAAGGAGAGCCTGGGGCCCGACAGCTCCTCTGGGGGTCAAG 

AGGACCCCGCCAGCCAGCAGTGGGCACGACCGCGCTTCACACAGCCCTCCAAGATGAGGCGC 

CGGGTGATCGCACGGCCCGTGGGTAGCTCCGTGCGGCTCAAGTGCGTGGCCAGCGGGCACCC 

TCGGCCCGACATCACGTGGATGAAGGACGACCAGGCCTTGACGCGCCCAGAGGCCGCTGAGC 

CCAGGAAGAAGAAGTGGACACTGAGCCTGAAGAACCTGCGGCCGGAGGACAGCGGCAAATAC 

ACCTGCCGCGTGTCGAACCGCGCGGGCGCCATCAACGCCACCTACAAGGTGGATGTGATCCA 

GCGGACCCGTTCCAAGCCCGTGCTCACAGGCACGCACCCCGTGAACACGACGGTGGACTTCG 

GGGGGACCACGTCCTTCCAGTGCAAGGTGCGCAGCGACGTGAAGCCGGTGATCCAGTGGCTG 

AAGCGCGTGGAGTACGGCGCCGAGGGCCGCCACAACTCCACCATCGATGTGGGCGGCCAGAA 

GTTTGTGGTGCTGCCCACGGGTGACGTGTGGTCGCGGCCCGACGGCTCCTACCTCAATAAGC 

TGCTCATCACCCGTGCCCGCCAGGACGATGCGGGCATGTACATCTGCCTTGGCGCCAACACC 

ATGGGCTACAGCTTCCGCAGCGCCTTCCTCACCGTGCTGCCAGACCCAAAACCGCCAGGGCC 

ACCTGTGGCCTCCTCGTCCTCGGCCACTAGCCTGCCGTGGCGCGTGGTCATCGGCATCCCAG 

CCGGCGCTGTCTTCATCCTGGGCACCCTGCTCCTGTGGCTTTGCCAGGCCCAGAAGAAGCCG 

TGCACCCCCGCGCCTGCCCCTCCCCTGCCTGGGCACCGCCCGCCGGGGACGGCCCGCGACCG 

CAGCGGAGACAAGGACCTTCCCTCGTTGGCCGCCCTCAGCGCTGGCCCTGGTGTGGGGCTGT 

GTGAGGAGCATGGGTCTCCGGCAGCCCCCCAGCACTTACTGGGCCCAGGCCCAGTTGCTGGC 

CCTAAGTTGTACCCCAAACTCTACACAGACATCCACACACACACACACACACACTCTCACAC 

ACACTCACACGTGGAGGGCAAGGTCCACCAGCACATCCACTATCAGTGC TAG ACGGCACCGT 

ATCTGCAGTGGGCACGGGGGGGCCGGCCAGACAGGCAGACTGGGAGGATGGAGGACGGAGCT 

GCAGACGAAGGCAGGGGACCCATGGCGAGGAGGAATGGCCAGCACCCCAGGCAGTCTGTGTG 

TGAGGCATAGCCCCTGGACACACACACACAGACACACACACTACCTGGATGCATGTATGCAC 

ACACATGCGCGCACACGTGCTCCCTGAAGGCACACGTACGCACACGCACATGCACAGATATG 

CCGCCTGGGCACACAGATAAGGTGGCCAAATGCACGCACACGCACAGAGACATGCCAGAACA 

TACAAGGACATGCTGCCTGAACATACACACGCACACCCATGCGCAGATGTGCTGCCTGGACA 

CACACACACACACGGATATGCTGTCTGGACGCACACACGTGCAGATATGGTATCCGGACACA 

CACGTGCACAGATATGCTGCCTGGACACACAGATAATGCTGCCTTGACA JACACATGCACGG 

ATATTGCCTGGACACACAGACACACACACGCGTGCACAGATATGCTGTCTGGACACGCACAC 

ACATGCAGATATGCTGCCTGGACACACACTTCCAGACACACGTGCACAGGCGCAGATATGCT 

GCCTGGACACACGCAGATATGCTGTCTAGTCACACACACACGCAGACATGCTGTCCGGACAC 

ACACACGCATGCACAGATATGCTGTCCGGACACACACACGCACGCAGATATGCTGCCTGGAC 

ACACACACAGATAATGCTGCCTCAACACTCACACACGTGCAGATATTGCCTGGACACACACA 

TGTGCACAGATATGCTGTCTGGACATGCACACACGTGCAGATATGCTGTCCGGATACACACG 

CACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGACACACATGCACACACAGGT 

GCAGATATGCTGCCTGGACACACACACAGATAATGCTGCCTCAACACTC^CACACGTGCAGA 

TATTGCCTGGACACACACATGTGCACAGATATGCTGTCTGGACATGCACACACGTGCAGATA 

TGCTGTCCGGATACACACGCACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGA 

CACACATGCACACACAGGTGCAGATATGCTGCCTGGACACACGCAGACTGACGTGCTTTTGG 

GAGGGTGTGCCGTGAAGCCTGCAGTACGTGTGCCGTGAGGCTCATAGTTGATGAGGGACTTT 

CCCTGCTCCACCGTCACTCCCCCAACTCTGCCCGCCTCTGTCCCCGCCTCAGTCCCCGCCTC 

CATCCCCGCCTCTGTCCCCTGGCCTTGGCGGCTATTTTTGCCACCTGCCTTGGGTGCCCAGG 

AGTCCCCTACTGCTGTGGGCTGGGGTTGGGGGCACAGCAGCCCCAAGCCTGAGAGGCTGGAG 

CCCATGGCTAGTGGCTCATCCCCAGTGCATTCTCCCCGTGACACAGAGAAGGGGCCTTGGTA 

TTTATATTTAAGAAATGAAGATAATATTAATAATGATGGAAGGAAGACTGGGTTGCAGGGAC 

TGTGGTCTCTCCTGGGGCCCGGGACCCGCCTGGTCTTTCAGCCATGCTGATGACCACACCCC 

GTCCAGGCCAGACACCACCCCCCACCCCACTGTCGTGGTGGCCCCAGATCTCTGTAATTTTA 

TGTAGAGTTTGAGCTGAAGCCCCGTATATTTAATTTATTTTGTTAAACACAAAA 
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MTPSPLLLLLLPPLLLGAFPPAAAARGPPKR^DKVVPRQVARLGRTVRLQCPVEGDPPPLTM 
WTKDGRTIHSGWSRFRVLPQGLKVKQVEREDAGVYVCKATNGFGSLSVNYTLWLDDISPGK 
ESLGPDSSSGGQEDPASQQWARPRFTQPSKMRRRVIARPVGSSVRLKCVASGHPRPDITWMK 
DDQALTRPEAAEPRKKKWTLSLKNLRPEDSGKYTCRVSNRAGAINATYKVDVIQRTRSKPVL 
TGTHPVNTTVDFGGTTSFQCKVRSDVKPVIQWLKRVEYGAEGRHNSTIDVGGQKFWLPTGD 
VWSRPDGSYLNKLLITRARQDDAGMYICLGANTMGYSFRSAFLTVLPDPKPPGPPVASSSSA 
TSLPWPWIGIPAGAVFILGTLLLWLCQAQKKPCTPAPAPPLPGHRPPGTARDRSGDKDLPS 
LAALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTDIHTHTHTHSHTHSHVEGKV 
HQHIHYQC 
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FIGURE 71A 

CCCAGCTGAGGAGCCCTGCTCAAGACACGGTCACTGGATCTGAGAAACTTCCCAGGGGACCG 
CATTCCAGAGTCAGTGACTCTGTGAAGCACCCACATCTACCTCTTGCCACGTTCCCACGGGC 
TTGGGGGAAAGATGGTGGGGACCAAGGCCTGGGTGTTCTCCTTCCTGGTCCTGGAAGTCACA 
TCTGTGTTGGGGAGACAGACGATGCTCACCCAGTCAGTAAGAAGAGTCCAGCCTGGGAAGAA 
GAACCCCAGCATCTTTGCCAAGCCTGCCGACACCCTGGAGAGCCCTGGTGAGTGGACAACAT 
GGTTCAACATCGACTACCCAGGGGGGAAGGGCGACTATGAGCGGCTGGACGCCATTCGCTTC 
TACTATGGGGACCGTGTATGTGCCCGTCCCCTGCGGCTAGAGGCTCGGACCACTGACTGGAC 
ACCTGCGGGCAGCACTGGCCAGGTGGTCCATGGTAGTCCCCGTGAGGGTTTCTGGTGCCTCA 
ACAGGGAGCAGCGGCCTGGCCAGAACTGCTCTAATTACACCGTACGCTTCCTCTGCCCACCA 
GGATCCCTGCGCCGAGACACAGAGCGCATCTGGAGCCCATGGTCTCCCTGGAGCAAGTGCTC 
AGCTGCCTGTGGTCAGACTGGGGTCCAGACTCGCACACGCATTTGCTTGGCAGAGATGGTGT 
CGCTGTGCAGTGAGGCCAGCGAAGAGGGTCAGGACTGCATGGGCCAGGAGTGTAGAGCCTGT 
GACCTGACCTGCCCAATGGGCCAGGTGAATGCTGACTGTGATGCCTGCATGTGCCAGGACTT 
CATGCTTCATGGGGCTGTCTCCCTTCCCGGAGGTGCCCCAGCCTCAGGGGCTGCTATCTACC 
TCCTGACCAAGACGCCGAAGCTGCTGACCCAGACAGACAGTGATGGGAGATTCCGAATCCCT 
GGCTTGTGCCCTGATGGCAAAAGCATCCTGAAGATCACAAAGGTCAAGTTTGCCCCCATTGT 
ACTCACAATGCC CAAGACTAGCCTGAAGGCAG CC AC CAT CAAGGCAGAGTTTGTGAGGGCAG 
AGACTCCATACATGGTGATGAACCCTGAGACAAAAGCACGGAGAGCTGGGCAGAGCGTGTCT 
CTGTGCTGTAAGGCCACAGGGAAGCCCAGGCCAGACAAGTATTTTTGGTATCATAATGACAC 
ATTGCTGGATCCTTCCCTCTACAAGCATGAGAGCAAGCTGGTGCTGAGGAAACTGCAGCAGC 
ACCAGGCTGGGGAGTACTTTTGCAAGGCCCAGAGTGATGCTGGGGCTGTGAAGTCCAAGGTT 
GCCCAGCTGATTGTCACAGCATCTGATGAGACTCCTTGCAACCCAGTTCCTGAGAGCTATCT 
TATCCGGCTGCCCCATGATTGCTTTCAGAATGCCACCAACTCCTTCTACTATGACGTGGGAC 
GCTGCCCTGTTAAGACTTGTGCAGGGCAGCAGGATAATGGGATCAGGTGCCGTGATGCTGTG 
CAGAACTGCTGTGGCATCTCCAAGACAGAGGAAAGGGAGATCCAGTGCAGTGGCTACACGCT 
ACCCACCAAGGTGGCCAAGGAGTGCAGCTGCCAGCGGTGTACGGAAACTCGGAGCATCGTGC 
GGGGCCGTGTCAGTGGTGCTGACAATGGGGAGCCCATGCGCTTTGGCCATGTGTACATGGGG 
AACAGCCGTGTAAGCATGACTGGCTACAAGGGCACTTTCACCCTCCATGTCCCCCAGGACAC 
TGAGAGGCTGGTGCTCACATTTGTGGACAGGCTGCAGAAGTTTGTCAACACCACCAAAGTGC 
TACCTTTCAACAAGAAGGGGAGTGCCGTGTTCCATGAAATCAAGATGCTTCGTCGGAAAGAG 
CCCATCACTTTGGAAGCCATGGAGACCAACATCATCCCCCTGGGGGAAGTGGTTGGTGAAGA 
CCCCATGGCTGAACTGGAGATTCCATCCAGGAGTTTCTACAGGCAGAATGGGGAGCCCTACA 
TAGGAAAAGTGAAGGCCAGTGTGACCTTCCTGGATCCCCGGAATATTTCCACAGCCACAGCT 
GCCCAGACTGACCTGAACTTCATCAATGACGAAGGAGACACTTTCCCCCTTCGGACGTATGG 
CATGTTCTCTGTGGACTTCAGAGATGAGGTCACCTCAGAGCCACTTAATGCTGGCAAAGTGA 
AGGTCCACCTTGACTCGACCCAGGTCAAGATGCCAGAGCACATATCCACAGTGAAACTCTGG 
TCACTCAATCCAGACACAGGGCTGTGGGAGGAGGAAGGTGATTTCAAATTTGAAAATCAAAG 
GAGGAACAAAAGAGAAGACAGAACCTTCCTGGTGGGCAACCTGGAGATTCGTGAGAGGAGGC 
TCTTTAACCTGGATGTTCCTGAAAGCAGGCGGTGCTTTGTTAAGGTGAGGGCCTACCGGAGT 
GAGAGGTTCTTGCCTAGTGAGCAGATCCAGGGGGTTGTGATCTCCGTGATTAACCTGGAGCC 
TAGAACTGGCTTCTTGTCCAACCCTAGGGCCTGGGGCCGCTTTGACAGTGTCATCACAGGCC 
CCAACGGGGCCTGTGTGCCTGCCTTCTGTGATGACCAGTCCCCTGATGCCTACTCTGCCTAT 
GTCTTGGCAAGCCTGGCTGGGGAGCAACTGCAAGCAGTGGAGTCTTCTCCTAAATTCAACCC 
AAATGCAATTGGCGTCCCTCAGCCCTATCTCAACAAGCTCAACTACCGTCGGACGGACCATG 
AGGATCCACGGGTTAAAAAGACAGCTTTCCAGATTAGCATGGCCAAGCCAAGGCCCAACTCA 
GCTGAGGAGAGCAATGGGCCCATCTATGCCTTTGAGAACCTCCGGGCATGTGAAGAGGCACC 
AC GCAGTG C AGC C C ACTT C CGGTT CTAC C AG AT TG AGGG GG AT G GAT AT G AC TACAAC AC AG 
TCCCCTTCAACGAAGATGACCCTATGAGCTGGACTGAAGACTATCTGGCATGGTGGCCAAAG 
CCGATGGAATTCAGGGCCTGCTATATCAAGGTGAAGATTGTGGGGCCACTGGAAGTGAATGT 
G CG ATC CC GC AACATGGGGG G C AC TC AT CGGCGG AC AG TGGGGAAGCTGTATGGAATCC GAG 
ATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGTCTCAGCTGCCTGTCTGGAGTTCAAG 
TGCAGTGGGATGCTCTATGATCAGGACCGTGTGGACCGCACCCTGGTGAAGGTCATCCCCCA 
GGG C AG CT GC CGTC GAGCCAGTGTGAACCCCATGCTGC AT GAGT AC CTGGTCAACC ACT TGC 
CACTTGCAGTCAACAACGACACCAGTGAGTACACCATGCTGGCACCCTTGGACCCACTGGGC 
CACAACTATGGCATCTACACTGTCACTGACCAGGACCCTCGCACGGCCAAGGAGATCGCGCT 
CGGCCGGTGCTTTGATGGCACATCCGATGGCTCCTCCAGAATCATGAAGAGCAATGTGGGAG 
TAGCCCTCACCTTCAACTGTGTAGAGAGGCAAGTAGGCCGCCAGAGTGCCTTCCAGTACCTC 
CAAAGCACCCCAGCCCAGTCCCCTGCTGCAGGCACTGTCCAAGGAAGAGTGCCCTCGAGGAG 
GCAGCAGCGAGCGAGCAGGGGTGGCCAGCGCCAGGGTGGAGTGGTGGCCTCTCTGAGATTTC 
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CTAGAGTTGCTCAACAGCCCCTGATCAACTAAGTTTTGTGGTACTTCACCCTCTTCTGCCCT 
CATTTCATGTGACAGCCATTGTGAGACTGATGCACAAACTGTCACTTGGTTAATTTAAGCAC 
TTCTGTTTTCGTGAATTTGCTTGTTTGTTTCTTCATGCCTTTACTTACTTTGTGCCATGCTA 
CTGATTGGCACGTGGCCCCCACAATGGCACAATAAAGCCCCTTTGTGAAACTGTTCTTTAAA 
TGAAACACAAGAAATTGGCCACTGGTAAAACTCTGCAGCTTCAACTGTACTTCATTTAATGC 
CATTAATGCAAATATACTTCCTCTTCTTTTTGCATGGTTTTGCCCACCTCTGCAATAGTGAT 
AATCTGATGCTGAAGATCAAATAACCAATATAAAGCATATTTCTTGGCCTTGCTCCACAGGA 
CATAGGCAAGCCTTGATCATAGTTCATACATATAAATGGTGGTGAAATAAAGAAATAAAACA 
CAATACTTTTACTTGAAATGTAAATAACTTATTTATTTCTTTGCTAAATTTGGAATTCTAGT 
GCACATTCAAAGTTAAGCTATTAAATATAGGGTGAT CATAGTTCCTCTACCAAGTCTGGAAA 
GAACATCTCCTGGTATCCACAATTACACCAGGTTGCTAACTGTATTTGTACATTTCCCTTTG 
CATTCGCTTTTGTTCTTGCTAGAAACCCAGTGTAGCCCAGGGCAGATGTCAATAAATGCATA 
CTCTGTATTTCGAAAAAA 
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IWGTKAWVFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPSIFAKPADTLESPGEWTTWFNI 
DYPGGKGDYERLDAIRFYYGDRVCARPLRLEARTTDWTPAGSTGQVVHGSPREGFWCLNREQ 
RPGQNCSNYTVRFLCP PGSLRRDTER I WS PWS PWS KCS AACGQTGVQTRTRI CLAEMVSLCS 
EASEEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASGAAIYLLTK 
TPKLLTQTDSDGRFRI PGLCPDGKS ILKI TKVKFAP I VLTMPKTSLKAATI KAEFVRAETPY 
IWMNPETKARRAGQSVSLCCKATGKPRPDKYFWYHNDTLLDPSLYKHESKLVLRKLQQHQAG 
EYFCKAQSDAGAVKSKVAQLIVTASDETPCNPVPESYLIRLPHDCFQNATNSFYYDVGRCPV 
KTCAGQQDNGIRCRDAVQNCCGISKTL^REIQCSGYTLPTKVAKECSCQRCTETRSIVRGRV 
SAADNGEPMRFGHVYMGNSRVSMTGYKGTFTLHVPQDTERLVLTFVDRLQKFVNTTKVLPFN 
KKGSAVFHE I KMLRRKE P I TLE AMETN 1 1 PLGEWGEDPMAELE I PSRSFYRQNGEPYIGKV 
KASVTFLDPRNISTATAAQTDLNFINDEGDTFPLRTYGMFSVDFRDEVTSEPL.NAGKVKVHL 
DSTQVKMPEHISTVKLWSLNPDTGLWEEEGDFKFENQRRNKREDRTFLVGNLEIRERRLFNL 
DVPESRRCFVKVRAYRSERFLPSEQIQGWISVINLEPRTGFLSNPRAWGRFDSVITGPNGA 
CVPAFCDDQSPDAYSAYVLASLAGEELQAVESSPKFNPNAIGVPQPYLNKLNYRRTDHEDPR 
VKKTAFQISMAKPRPNSAEESNGPIYAFENLRACEEAPPSAAHFRFYQIEGDRYDYNTVPFN 
EDDPMSWTEDYLAWWPKPMEFRACYIKVKIVGPLEVNVRSRNMGGTHRRTVGKLYGIRDVRS 
TRDRDQPNVSAACLEFKCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLHEYLVNHLPLAV 
NNDTSEYTMLAPLDPLGHNYGIYTVTDQDPRTAKEIALGRCFDGTSDGSSRIMKSNVGVALT 
FNCVERQVGRQSAFQYLQSTPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGWASLRFPRVA 
QQPLIN 
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CTGCAAGTTGTTAACGCCTAACACACAAGTATGTTAGGCTTCCACCAAAGTCCTCAATATAC 
CTGAATACGCACAATATCTTAACTCTTCATATTTGGTTTTGGGATCTGCTTTGAGGTCCCAT 
CTTCATTTAAAAAAAAATACAGAGACCTACCTACCCGTACGCATACATACATATGTGTATAT 
ATATGTAAACTAGACAAAGATCGCAGATCATAAAGCAAGCTCTGCTTTAGTTTCCAAGAAGA 
TTACAAAGAATTTAGAGAIGTATTTGTCAAGATCCCTGTCGATTCATGCCCTTTGGGTTACG 
GTGTCCTCAGTGATGCAGCCCTACCCTTTGGTTTGGGGACATTATGATTTGTGTAAGACTCA 
GATTTACACGGAAGAAGGGAAAGTTTGGGATTACATGGCCTGCCAGCCGGAATCCACGGACA 
TGACAAAATATCTGAAAGTGAAACTCGATCCTCCGGATATTACCTGTGGAGACCCTCCTGAG 
ACGTTCTGTGCAATGGGCAATCCCTACATGTGCAATAATGAGTGTGATGCGAGTACCCCTGA 
GCTGGCACACCCCCCTGAGCTGATGTTTGATTTTGAAGGAAGACATCCCTCCACATTTTGGC 
AGTCTGCCACTTGGAAGGAGTATCCCAAGCCTCTCCAGGTTAACATCACTCTGTCTTGGAGC 
AAAACCATTGAGCTAACAGACAACATAGTTATTACCTTTGAATCTGGGCGTCCAGACCAAAT 
GATC CTGG AG AAGT CT CT CG AT TATGGACGAACATGGCAGCCCTATCAGT ATTATG CCAC AG 
ACTGCTTAGATGCTTTTCACATGGATCCTAAATCCGTGAAGGATTTATCACAGCATACGGTC 
TTAGAAAT CATT TG CACAGAAGAGTACT CAAC AGGGTAT ACAAC AAAT AG CAAAAT AATC CA 
CTTTGAAATCAAAGACAGGTTCGCGCTTTTTGCTGGACCTCGCCTACGCAATATGGCTTCCC 
TCTACGGACAGCTGGATACAACCAAGAAACTCAGAGATTTCTTTACAGTCACAGACCTGAGG 
ATAAGGCTGTTAAGACCAGCCGTTGGGGAAATATTTGTAGATGAGCTACACTTGGCACGCTA 
CTTTTACGCGATCTCAGACATAAAGGTGCGAGGAAGGTGCAAGTGTAATCTCCATGCCACTG 
TATGTGTGTATGACAACAGCAAATTGACATGCGAATGTGAGCACAACACTACAGGTCCAGAC 
TGTGGGAAATGCAAGAAGAATTATCAGGGCCGACCTTGGAGTCCAGGCTCCTATCTCCCCAT 
CCCCAAAGGCACTGCAAATACCTGTATCCCCAGTATTTCCAGTATTGGTACGAATGTCTGCG 
ACAACGAGCTCCTGCACTGCCAGAACGGAGGGACGTGCCACAACAACGTGCGCTGCCTGTGC 
CCGGCCGCATACACGGGCATCCTCTGCGAGAAGCTGCGGTGCGAGGAGGCTGGCAGCTGCGG 
CTCCGACTCTGGCCAGGGCGCGCCCCCGCACGGCACCCCAGCGCTGCTGCTGCTGACCACGC 
TGCTGGGAACCGCCAGCCCCCTGGTGTTCTAGGTGTCACCTCCAGCCACACCGGACGGGCCT 
GTGCCGTGGGGAAGCAGACACAACCCAAACATTTGCTACTAACATAGGAAACACACACATAC 
AGACACCCCCACTCAGACAGTGTACAAACTAAGAAGGCCTAACTGAACTAAGCCATATTTAT 
CACCCGTGGACAGCACATCCGAGTCAAGACTGTTAATTTCTGACTCCAGAGGAGTTGGCAGC 
TGTTGATATTAT CACTGCAAATCACATTGCCAGCTGCAGAGCATATTGTGGATTGGAAAGGC 
TGCGACAGCCCCCCAAACAGGAAAGACAAAAAACAAACAAATCAACCGACCTAAAAACATTG 
GCTACTCTAGCGTGGTGCGCCCTAGTACGACTCCGCCCAGTGTGTGGACCAACCAAATAGCA 
TTCTTTGCTGTCAGGTGCATTGTGGGCATAAGGAAATCTGTTACAAGCTGCCATATTGGCCT 
GCTTCCGTCCCTGAATCCCTTCCAACCTGTGCTTTAGTGAACGTTGCTCTGTAACCCTCGTT 
GGTTGAAAGATTTCTTTGTCTGATGTTAGTGATGCACATGTGTAACAGCCCCCTCTAAAAGC 
GCAAGCCAGTCATACCCCTGTATATCTTAGCAGCACTGAGTCCAGTGCGAGCACACACCCAC 
TATACAAGAGTGGCTATAGGAAAAAAGAAAGTGTATCTATCCTTTTGTATTCAAATGAAGTT 
ATTTTTCTTGAACTACTGTAATATGTAGATTTTTTGTATTATTGCCAATTTGTGTTACCAGA 
CAATCTGTTAATGTATCTAATTCGAATCAGCAAAGACTGACATTTTATTTTGTCCTCTTTCG 
TTCTGTTTTGTTTCACTGTGCAGAGATTTCTCTGTAAGGGCAACGAACGTGCTGGCATCAAA 
GAATATCAGTTT AC AT AT AT AACAAGTG TAAT AAGATT C C AC CAAAGGACATT CTAAATGTT 
TTCTTGTTGCTTTAACACTGGAAGATTTAAAGAATAAAAACTCCTGCATAAACGATTTCAGG 
AATTTGTATTGCAATTTCTTAAGATGAAAGGAACAGCCACCAAGCAGTTTCACACTCACTTT 
ACTGATTTCTGTGTGGACTGAGTACATTCAGCTGACGAATTTAGTTCCCAGGAAGATGGATT 
GATGTTGACTAGCTTGGAGAACTTCTGCAAAATATGAGACTATTTCCACTTGGGAAAAATTA 
CAACAGCAAAAAAAAAAAAAAAAAAAAAA 
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MYLSRSLS IHALWVTVSSVMQPYPLVWGHYDLCKTQ I YTEEGKVWDYMACQPESTDMTKYLK 
VKLDPPDITCGDPPETFCAMGNPYMCNNECDASTPELAHPPELMFDFEGRHPSTFWQSATWK 
EYPKPLQVNITLSWSKTIELTDNIVITFESGRPDQMILEKSLDYGRTWQPYQYYATDCLDAF 
HMDPKSVKDLSQHTVLEI ICTEEYSTGYTTNSKI IHFEIKDRFALFAGPRLRNMASLYGQLD 
TTKKLRDFFTVTDLRIRLLRPAVGEIFVDELHLARYFYAISDIKVRGRCKCNLHATVCVYDN 
SKLTCECEHNTTGPDCGKCKKNYQGRPWSPGSYLP1PKGTANTCIPSISSIGTNVCDNELLH 
CQNGGTCHNNVRCLCPAAYTGILCEKLRCEEAGSCGSDSGQGAPPHGTPALLLLTTLLGTAS 
PLVF 
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FIGURE 75 

CCCACGCGTCCGGGTGACCTGGGCCGAGCCCTCCCGGTCGGCTAAGATTGCTGAGGAGGCGG 

CGGGTAGCTGGCAGGCGCCGACTTCCGAAGGCCGCCGTCCGGGCGAGGTGTCCTCATGACTT 

CTCTTGTGGACCATGTCCGTGATCTTTTTTGCCTGCGTGGTACGGGTAAGGGATGGACTGCC 

CCTCTCAGCCTCTACTGATTTTTACCACACCCAAGATTTTTTGGAATGGAGGAGACGGCTCA 

AGAGTTTAGCCTTGCGACTGGCCCAGTATCCAGGTCGAGGTTCTGCAGAAGGTTGTGACTTT 

AGTATACATTTTTCTTCTTTCGGGGACGTGGCCTGCATGGCTATCTGCTCCTGCCAGTGTCC 

AGCAGCCATGGCCTTCTGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCCTATGACA 

CTACCTGCATTGGCCTAGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAG 

AAAGTGAAGTGGCATTTTAACTATGTAAGTTCCTCTCAGATGGAGTGCAGCTTGGAAAAAAT 

TCAGGAGGAGCTCAAGTTGCAGCCTCCAGCGGTTCTCACTCTGGAGGACACAGATGTGGCAA 

ATGGGGTGATGAATGGTCACauaCCGATGCACTTGGAGCCTGCTCCTAATTTCCGAATGGAA 

CCAGTGACAGCCCTGGGTATCCTCTCCCTCATTCTCAACATCATGTGTGCTGCCCTGAATCT 

CATTCGAGGAGTTCACCTTGCAGAACATTCTTTACAGGATCCAAGGAGCTGGTTCTGCTGGT 

TGGACCAAACCTCGTGAGCCAGCCACCCCTGACCCAAATGAGGAGAGCTCTGATTCTCCCAT 

CCGGGAGCAGTGATGTCAAACTTCTGCTGCTGGGGAAATCTCATCAGCAGGGAGCCTGTGGA 

AAAGGGCATGTCAGTGAAATCTGGGAATGGCTGGATTCGGAAACATCTGCCCATGTGTATTG 

ATGGCAGAGCTGTTGCCCACAAGCGCCTTTTATTTAGGGT7VAAATTAACAAATCCATTCTAT 

TCCTCTGACCCATGCTTAGTACATATGACCTTTAACCCTTACATTTATATGATTCTGGGGTT 

GCTTCAGAAGTGTTATTTCATGAATCATTCATATGATTTGATCCCCCAGGATTCTATTTTGT 

TTAATGGGCTTTTCTACTAAAAGCATAAAATACTGAGGCTGATTTAGTCAGGGCAAAACCAT 

TTACTTTACATATTCGTTTTCAATACTTGCTGTTCATGTTACACAAGCTTCTTACGGTTTTC 

TTGTAACAATAAATATTTTGAGTAAATAATGGGTACATTTTAACAAAC x CAGTAGTACAACC 

TAAACTTGTATAAAAGTGTGTAAAAATGTATAGCCATTTATATCCTATGTATAAATTAAATG 

AGGTGGCTTCAGAAATGGCAGAATAAATCTAAAGTGTT^ 

AAAAG 
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MSVI FFACWRVRDGLPLiSASTDFyHTQDFLEWRRRLKSLALRLiAQy PGRGSAEGCDFS I HF 
SSFGDVACMAI CSCQCPAAMAFCFLETLWWEFTAS YDTTC I GLASRPYAFLE FDS I IQKVKW 
HFISrn/SSSQMECSLEKIQEELKLQPPAVLTLEDTDVANGVMNGHTPMHLEPAPNFRMEPVTA 
LG I LSL I LNI MCAALNL IRGVHLAEHSLQDPRSWFCWLDQTS 
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TGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCNTATGACACTACCTGCATTGGCNT 
AGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAGAAAGTGAAGTGGCATT 
TT AACT ATGT AAGT T C CTNT C AG A T Z GAGTGC AG CT TGGAAAAAATTC AGGAGGAGCT CAAG 
TTGCAGCCTCCAGCGGTTCTCANTATGGAGGACACAGATGTGGCAAATGGGGT 
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CTCAGCGGCGCTTCCTCGTAGCGAGCCTAGTGGCGGGTGTTTGCATTGAAACGTGAGCGCGA 
CCCGACCTTAAAGAGTGGGGAGCAAAGGGAGGACAGAGCCCTTTAAAACGAGGCGGGTGGTG 
CCTGCCCCTTTAAGGGCGGGGCGT CCGG ACGACTGTA.TCTGAGCCCCAGACTGCCCCGAGTT 
TCTGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGGTGCTTGGCGGCGGCGGCTT 
CCTCCCCGCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTA 
TGGAAGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGC 
GAGTGTATTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGAC 
CCGCTTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGA 
TTGCGCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCC 
TTCTCCATCATCAGCAATGAGGTGCTGCTCTCCCTGCCTCGGAACTACTACATCCAGTGGCT 
CAACGGCTCCCTCATCCATGGCCTCTGGAACCTTGTTTTTCTCTTCCCCAACCTGTCCCTCA 
TCTTCCTCATGCCCTTTGCATATTTCTTCACTGAGTCTGAGGGCTTTGCTGGCTCCAGAAAG 
GGTGTCCTGGGCCGGGTCTATGAGACAGTGGTGATGTTGATGCTCCTCACTCTGCTGGTGCT 
AGGTATGGTGTGGGTGGCATCAGCCATTGTGGACAAG7y\CAAGGCCAACAGAGAGTCACTCT 
ATGACTTTTGGGAGTACTATCTCCCCTACCTCTACTCATGCATCTCCTTCCTTGGGGTTCTG 
CTGCTCCTGGTGTGTACTCCACTGGGTCTCGCCCGCATGTTCTCCGTCACTGGGAAGCTGCT 
AGTCAAGCCCCGGCTGCTGGAAGACCTGGAGGAGCAGCTGTACTGCTCAGCCTTTGAGGAGG 
CAGCCCTGACCCGCAGGATCTGTAATCCTACTTCCTGCTGGCTGCCTTTAGACATGGAGCTG 
CTACACAGACAGGTCCTGGCTCTGCAGACACAGAGGGTCCTGCTGGAGAAGAGGCGGAAGGC 
TTCAGCCTGGCAACGGAACCTGGGCTACCCCCTGGCTATGCTGTGCTTGCTGGTGCTGACGG 
GCCTGTCTGTGCTCATTGTGGCCATCCACATCCTGGAGCTGCTCATCGATGAGGCTGCCATG 
CCCCGAGGCATGCAGGGTACCTCCTTAGGCCAGGTCTCCTTCTCCAAGCTGGGCTCCTTTGG 
TGCCGTCATTCAGGTTGTACTCATCTTTTACCTAATGGTGTCCTCAGTTGTGGGCTTCTATA 
GCTCTCCACTCTTCCGGAGCCTGCGGCCCAGATGGCACGACACTGCCATGACGCAGATAATT 
GGGAACTGTGTCTGTCTCCTGGTCCTAAGCTCAGCACTTCCTGTCTTCTCTCGAACCCTGGG 
GCTCACTCGCTTTGACCTGCTGGGTGACTTTGGACGCTTCAACTGGCTGGGCAATTTCTACA 
TTGTGTTCCTCTACAACGCAGCCTTTGCAGGCCTCACCACACTCTGTCTGGTGAAGACCTTC 
ACTGCAGCTGTGCGGGCAGAGCTGATCCGGGCCTTTGGGCTGGACAGACTGCCGCTGCCCGT 
CTCCGGTTTCCCCCAGGCATCTAGGAAGACCCAGCACCAGTGACCTCCAGCTGGGGGTGGGA 
AGGAAAAAACTGGACACTGCCATCTGCTGCCTAGGCCTGGAGGGAAGCCCAAGGCTACTTGG 
ACCTCAGGACCTGGAATCTGAGAGGGTGGGTGGCAGAGGGGAGCAGAGCCATCTGCACTATT 
GCATAATCTGAGCCAGAGTTTGGGACCAGGACCTCCTGCTTTTCCATACTTAACTGTGGCCT 
CAGCATGGGGTAGGGCTGGGTGACTGGGTCTAGCCCCTGATCCCAAATCTGTTTACACATCA 
ATCTGCCTCACTGCTGTTCTGGGCCATCCCCATAGCCATGTTTACATGATTTGATGTGCAAT 
AGGGTGGGGTAGwGGCAGGGAAAGGACTGGGCCAGGGCAGGCTCGGGAGATAGATTGTGTCC 
CTTGCCTGTGGCCCAGCAGAGCCTAAGCACTGTGCTATCCTGGAGGGGCTTTGGACCACCTG 
AAAGACCAAGGGGATAGGGAGGAGGAGGCTTCAGCCATCAGCAATAAAGTTGATCCCAGGGA 
AAAAAA 
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FIGURE 79 

MEAPDYEVLSVREQLFHERIRECIISTLLFATLYILCHIFLTRFKKPAEFTTVDDEDATVNK 
IALELCTFTLAIALGAVLLLPFSIISNEVLLSLPRNYYIQWLNGSLIHGLWNLVFLFPNLSL 
IFLMPFAYFFTESEGFAGSRKGVLGRVYETWMLMLLTL.LVLGMVWVASAIVDKNKANRESL 
YDFWEYYLPYLYSCISFLGVLLLLVCTPLGLARMFSVTGKLLVKPRLLEDL.EEQLYCSAFEE 
AALTRRICNPTSCWLPLDMELLHRQVLALQTQRVLLEKRRKASAWQRNLGYPLAMLCLLVLT 
GLSVLI VAIHILELLIDEAAMPRGMQGTSLGQVS FSKLGSFGAVIQVVL I FYLMVSSWGFY 
SSPLFRSLRPRWHDTAMTQI IGNCVCLLVLSSALPVFSRTLGLTRFDLLGDFGRFNWLGNFY 
IVFLYNAAFAGLTTLCLVKTFTAAVRAELIRAFGLDRLPLPVSGFPQASRKTQHQ 
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FIGURE 80 

GGCTGCCGAGGGAAGGCCCCTTGGGTTGGTCTTGGTTGCTTGGCGGCGGCGGNTTCNTCCCC 
GCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGAA.GC 
ACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGTGTA 
TTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGACCCGCTTC 
AAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCG 
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FIGURE 81 

GACCGACCTTAAAGAGTGGGAGCAAAGGGAGGACAGAGCCTTTTAAAACGAGGCGGTGGTGC 
CTGCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTTTC 
TGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGTGCTTGGCGGCGGCGGCTTCCT 
CCCCGTTGTCNTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGA 
AGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGT 
GTATTATATCAACACTTCTGTTTGCAACACTGTACATCNTCTGCCACATCTTCCTGACCCGC 
TTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGATTGC 
GCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCCTTCT 
CCATCATCAGCAATGAGGTGCTGCACTCCC 
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FIGURE 82 

GATGTGCTCCTTGGAGCTGGTGTGCAGTGTCGTGACTGTAAGATCAAGTCCAAACCTGTTTT 
GGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTGGTGGTCCAGGTCTTCATGCTGCTGT 
GGGTGATATTACTGGTCCTGGCTr : C7GTCAGTGGACAGTTTGCAAGGACACCCAGGCCCATT 
ATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAA 
g GGGATTTCGCTTCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAA 
$ TACTAAGAGAAACCCCAGACAATATCCTTGAGGTTCAGGAATCTGGAGAGTACAGATGCCAG 
GCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTCAGAGATGGGATTTCC 
' : TCATGCTGCCCAGGCTAATGTTGAA TCCTGGGCTCAAGTGATCTGCTCACCTAGGCCTCTC 

AAAGCGCTGGGATTACAGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGAC 
TCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAGTAAGACTGAATAATACTATTTACAAGAA 
TGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 83 

MLLWVILLVLAPVSGQFARTPRPIIFLQPPWTTVFQGERVTLTCKGFRFYSPQKTKWYHRYL 
GKEILRETPDNILEVQESGEYRCQAQGSPLSSPVHLDFSSEMGFPHAAQANVELLGSSDLLT 
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FIGURE 84 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGTGT 

GAGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGCGGCGGCGGAGGAGGCTGTGAG 

GAGTGTGTGGAACAGGACCCGGGACAGAGGAACCATGGCTCCGCAGAACCTGAGCACCTTTT 

GCCTGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATAAGATCTTG 

GGGGTGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACTAGCCCTGCA 

GCTTCATCCCGACCGGAACCCTGATGATCCACAAGCCCAGGAGAAATTCCAGGATCTGGGTG 

CTGCTTATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATACTTATGGTGAAGAA 

GGATTAAAAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCACACTTCTTTGGGGATTT 

TGGTTTCATGTTTGGAGGAACCCCTCGTCAGCAAGACAGAAATATTCCAAGAGGAAGTGATA 

TTATTGTAGATCTAGAAGTCACTTTGGAAGAAGTATATGCAGGAAATTTTGTGGAAGTAGTT 

AGAAACAAACCTGTGGCAAGGCAGGCTCCTGGCAAACGGAAGTGCAATTGTCGGCAAGAGAT 

GCGGACCACCCAGCTGGGCCCTGGGCGCTTCCAAATGACCCAGGAGGTGGTCTGCGACGAAT 

GCCCTAATGTCAAACTAGTGAATGAAGAACGAACGCTGGAAGTAGAAATAGAGCCTGGGGTG 

AGAGACGGCATGGAGTACCCCTTTATTGGAGAAGGTGAGCCTCACGTGGATGGGGAGCCTGG 

AGATTTACGGTTCCGAATCAAAGTTGTCAAGCACCCAATATTTGAAAGGAGAGGAGATGATT 

TGTACACAAATGTGACAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATTACT 

CACTTGGATGGTCACAAGGTACATATTTCCCGGGATAAGATCACCAGGCCAGGAGCGAAGCT 

ATGGAAGAAAGGGGAAGGGCTCCCCAACTTTGACAACAACAATATCAAGGGCTCTTTGATAA 

TCACTTTTGATGTGGATTTTCCAAAAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAA 

CAGCTACTGAAACAAGGGTCAGTGCAGAAGGTATACAATGGACTGCAAGGATATTGAGAGTG 

AATAAAATTGGACTTTGTTTAAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTT 

TTGTGTGTGTTTTTGTTTTTATTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGA 

TCATCATGAAATGAATAAGAGGGCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACC 

AGCAAAAGGTTTACTAATACCTCTCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGT 

TTCAAGAATTAAAGCTGCAAGAGGACTCCAGGAGCAAAAGAAACACAATATAGAGGGTTGGA 

GTTGTTAGCAATTTCATTCAAAATGCCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTG 

TTATTTTTA 
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FIGURE 85 

MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDDPQ 
AQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFFGDFGFMFGGTPRQQ 
DRNIPRGSDIIVDLEVTLEEVYAGNFVEWRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQ 
MTQEWCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVDGEPGDLRFRIKWKH 
PIFERRGDDLYTNVTISLVESLVGFEMDITHLDGHKVHISRDKITRPGAKLWKKGEGLPNFD 
3STNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLQGY 
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FIGURE 86 

TGGGACCAGGGAACCCCGGGCCCCCCGGTGGAGNGCCTAACAGGCCGGTGGNTGCGACCGAA 
GCGGCGGGCGGAGGAGGTTTTGAGGATTTTTGGAACAGGACCCGGACAGAGGAACCATGGTT 
CCGCAGAACNTGAGCACNTTTTGCCTGTTGNTGNTATACTTCATCGGGGCGGTGATTGCCGG 
ACGAGATTTNTATAAGATTTTGGGGTGCCTNGAAGTGCCTTNTATAAAGGATATTAAAAAGG 
CCTATAGGAAACTAGCCCTGCAGNTTTATCCCGACCGGAACCCTGATGATCCACAAGCCCAG 
GAGAAATTCCAGGATTTGGGTGCTGCTTATGAGGTTNTGTCAGATAGTGAGAAACGGAAACA 
GTACGATAATTATGGTGAAGAAGGATTAAAAGATGGTNATCAGAGCTCCCATGGAGACATTT 
TTTCACACTTNTTTGGGGATTTTGGTTTCATGTTTGGAGGAACCCCTNGTCAGCAAGACAGA 
AATATTCCAAGAG 
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FIGURE 87 

GGCACGAGGCGGCGGGGCAGTCGCGGGATGCGCCCGGGAGCCACAGCCTGAGGCCCTCAGGT 
CTCTGCAGGTGTCGTGGAGGAACCTAGCACCTGCCATCCTCTTCCCCAATTTGCCACTTCCA 
GCAGCTTTAGCCCATGAGGAGGATGTGACCGGGACTGAGTCAGGAGCCCTCTGGAAGCATGG 
AGACTGTGGTGATTGTTGCCATAGGTGTGCTGGCCACCATCTTTCTGGCTTCGTTTGCAGCC 
TTGGTGCTGGTTTGCAGGCAGCGCTACTGCCGGCCGCGAGACCTGCTGCAGCGCTATGATTC 
TAAGCCCATTGTGGACCTCATTGGTGCCATGGAGACCCAGTCTGAGCCCTCTGAGTTAGAAC 
TGGACGATGTCGTTATCACCAACCCCCACATTGAGGCCATTCTGGAGAATGAAGACTGGATC 
GAAGATGCCTCGGGTCTCATGTCCCACTGCATTGCCATCTTGAAGATTTGTCACACTCTGAC 
AGAGAAGCTTGTTGCCATGACAATGGGCTCTGGGGCCAAGATGAAGACTTCAGCCAGTGTCA 
GCGACATCATTGTGGTGGCCAAGCGGATCAGCCCCAGGGTGGATGATGTTGTGAAGTCGATG 
TACCCTCCGTTGGACCCCAAACTCCTGGACGCACGGACGACTGCCCTGCTCCTGTCTGTCAG 
T C AC CTGG TG CTGG TG ACAAGGAATG C CTGC CAT CTG A CGGGAGGCCTGG AC TGGATTGACC 
AGTCTCTGTCGGCTGCTGAGGAGCATTTGGAAGTCCTTCGAGAAGCAGCCCTAGCTTCTGAG 
CCAGATAAAGGCCTCCCAGGCCCTGAAGGCTTCCTGCAGGAGCAGTCTGCAATTTAGTGCCT 
ACAGGCCAGCAGCTAGCCATGAAGGCCCCTGCCGCCATCCCTGGATGGCTCAGCTTAGCCTT 
CTACTTTTTCCTATAGAGTTAGTTGTTCTCCACGGCTGGAGAGTTCAGCTGTGTGTGCATAG 
TAAAGCAGGAGATCCCCGTCAGTTTATGCCTCTTTTGCAGTTGCAAACTGTGGCTGGTGAGT 
GGCAGTCTAATACTACAGTTAGGGGAGATGCCATTCACTCTCTGCAAGAGGAGTATTGAAAA 
CTGGTGGACTGTCAGCTTTATTTAGCTCACCTAGTGTTTTCAAGAAAATTGAGCCACCGTCT 
AAGAAATCAAGAGGTTTCACATTAAAATTAGT^ATTTCTGGCCTCTCTCGATCGGTCAGAATG 
TGTGGCAATTCTGATCTGCATTTTCAGAAGAGGACAATCAATTGAAACTAAGTAGGGGTTTC 
TTCTTTTGGCAA.GACTTGTACTCTCTCACCTGGCCTGTTTCATTTATTTGTATTATCTGCCT 
GGTCCCTGAGGCGTCTGGGTCTCTCCTCTCCCTTGCAGGTTTGGGTTTGAAGCTGAGGAACT 
ACAAAGTTGATG A.TTTCTTTTTTATCTTTATGCCTGCAATTTTACCTAGCTACCACTAGGTG 
GATAGTAAATTTATACTTATGTTTCCCTCAAAAAAAAAAAAAAA 
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FIGURE 88 

METVVIVAIGVIjATIFLASFAAIjVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSEL 
ELDDW I TNPH I EAI LENEDWI EDASGLMSHC I A I LK I CHTLTE KL VAMTMGSGAKMKTS AS 
VSD 1 1 WAKR I S PRVDD WKSMY P PLDP KLLDARTTALLLS VSHLVLVTRNACHLTGGLDW I 
DQSLSAAEEHLEVLREAALASEPDKGLPGPEGFLQEQSAI 

$ 
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FIGURE 89 

GCTTCATTTCTCCCGACTCAGCTTCCCACCCTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCC 
CCACCACTGCAGCCA^ATCTCCTTAACGGACACGCAGAAAATTGGAATGGGATTAACAGGA 
TTTGGAGTGTTTTTCCTGTTCTTTGGAATGATTCTCTTTTTTGACAAAGCACTACTGGCTAT 

- TGGAAATGTTTTATTTGTAGCCGGCTTGGCTTTTGTAATTGGTTTAGAAAGAACATTCAGAT 
TCTTCTTCCAAAAACATAAAATGAAAGCTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTC 
CTTATTGGTTGGCCTTTGATAGGCATGATCTTCGAAATTTATGGATTTTTTGTCTTGTTCAG 
GGGCTTCTTTCCTGTCGTTGTTGGCTTTATTAGAAGAGTGCCAGTCCTTGGATCCCTCCTAAAT 
TTACCTGGAATTAGATCATTTGTAGATAAAGTTGGAGAAAGCAACAATATGGTATAACAACA 
AGTC . ^ATTTGAAGACTCATTTAAAATATTGTGTTATTTA'VAAAGTCATTTGAAGAATATTCA 
GCACAAAATTAAATTACATGAAATAGCTTGTAATGTTCTTTACAGGAGTTTAAAACGTATAG 
CCTACAAAGTACCAGCAGCAAATTAGCAAAGAAGCAGTGAAAACAGGCTTCTACTCAAGTGA 
ACTAAGAAGAAG T CAGCAAG CAAA CTGAG AGAGG TG AAAT CC ATGTT AATGATG CTTAAGAA 
ACTCTTGAAGGCTATTTGTGTTGTTTTTCCACAATGTGCGAAACTCAGCCATCCTTAGAGAA 
CTGTGGTGCCTGTTTCTTTTCTTTTTATTTTGAAGGCTCAGGAGCATCCATAGGCATTTGCT 

% TTTTAGAAGTGTCCACTGCAATGGCAAAAATATTTCCAGTTGCACTGTATCTCTGGAAGTGA 
TGCATGAATTCGATTGGATTGTGTCATTTTAAAGTATTAAAACCAAGGAAACCCCAATTTTG 
ATGTATGGATTACTTTTTTTTGNGCNCAGGGCC 
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FIGURE 90 

MISLTDTQKIGMGLTGFGVFFLFFGMILFFDKALLAIGNVLFVAGLAFVIGLERTFRFFFQK 
HKMKATGFFLGGVFWLIGWPLIGMIFEIYGFFLLFRGFFPVWGFIRRVPVLGSLLNLPGI 
RSFVDKVGESNNMV 

Important features: 
Transmembrane domains : 

amino acids 12-30 (typell) , 33-52, 69-89 and 93-109 

N-myristoylation sites. 

amino acids 11-16, 51-56 and 116-121 

Aminoacyl- transfer RNA synthetases class -II protein, 

amino acids 49-5 9 



BNSDOC1D: <W 0__, 996308 8 A2 .1 > 



WO 99/63088 ^ t . ^ PCT/US99/12252 



FIGURE 91 



GAAGACGTGGCGGCTCTCGCCTGGGCTGTTTCCCGGCTTCATTTCTCCCGACTCAGCTTCCC 
ACCNTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCCCCACCACTGCAGCCATGATCTCCTTAA 
CGGACACGCAGAAAATTGGAATGGGATTAACCGGATTTGGAGTGTTTTTCCTGTTCTTTGGA 
ATGATTCTCTTTTTTGACAAAGGACTACTGGCTATTGGAAATGTTTTATTTGTAGCCGGCTT 
GGCTTTTGTAATTGGTTTAGAAAGAACATTCAGATTCTTCTTCCAAAAACATAAAATGAAAG 
§ CTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTCCTTATTGGTTGGCCTTTGATAGGCATG 
ATCTTCGAAATTTATGGATTTTTTCTCTTGTTC 
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FIGURE 92 

GGCACGAGGCTGAACCCAGCCGGCTCCATCTCAGCTTCTGGTTTCTAAGTCCATGTGCCAAA 

GGCTGCCAGGAAGGAGACGCCTTCCTGAGTCCTGGATCTTTCTTCCTTCTGGAAATCTTTGA 

CTGTGGGTAGTTATTTATTTCTGAATAAGAGCGTCCACGCATCATGGACCTCGCGGGACTGC 

TGAAGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTGCCTCAGGGCTAATC 

ATCAACACCATTCAGCTCTTCACTCTCCTCCTCTGGCCCATTAACAAGCAGCTCTTCCGGAA 

GATCAACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCTGCTGGAGTGGTGGT 

CGGGCACGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAAGTATGGGAAGGAAAAT 

GCCATCGTGGTTCTCAACCACAAGTTTGAAATTGACTTTCTGTGTGGCTGGAGCCTGTCCGA 

ACGCTTTGGGCTGTTAGGGGGCTCCAAGGTCCTGGCCAAGAAAGAGCTGGCCTATGTCCCAA 

TTATCGGCTGGATGTGGTACTTCACCGAGATGGTCTTCTGTTCGCGCAAGTGGGAGCAGGAT 

CGCAAGACGGTTGCCACCAGTTTGCAGCACCTCCGGGACTACCCCGAGAAGTATTTTTTCCT 

GATTCACTGTGAGGGCAGACGGTTCACGGAGAAGAAGCATGAGATCAGCATGCAGGTGGCCC 

GGGCCAAGGGGCTGCCTCGCCTCAAGCATCACCTGTTGCCACGAACCAAGGGCTTCGCCATC 

ACCGTGAGGAGCTTGAGAAATGTAGTTTCAGCTGTATATGACTGTACACTCAATTTCAGAAA 

TAATGAAAATCCAACACTGCTGGGAGTCCTAAACGGAAAGAAATACCATGCAGATTTGTATG 

TTAGGAGGATCCCACTGGAAGACATCCCTGAAGACGATGACGAGTGCTCGGCCTGGCTGCAC 

AAGCTCTACCAGGAGAAGGATGCCTTTCAGGAGGAGTACTACAGGACGGGCACCTTCCCAGA 

GACGCCCATGGTGCCCCCCCGGCGGCCCTGGACCCTCGTGAACTGGCTGTTTTGGGCCTCGC 

TGGTGCTCTACCCTTTCTTCCAGTTCCTGGTCAGCATGATCAGGAGCGGGTCTTCCCTGACG 

CTGGCCAGCTTCATCCTCGTCTTCTTTGTGGCCTCCGTGGGAGTTCGATGGATGATTGGTGT 

GACGGAAATTGACAAGGGCTCTGCCTACGGCAACTCTGACAGCAAGCAGAAACTGAATGACT 

GACTCAGGGAGGTGTCACCATCCGAAGGGAACCTTGGGGAACTGGTGGCCTCTGCATATCCP 

CCTTAGTGGGACACGGTGACAAAGGCTGGGTGAGCCCCTGCTGGGGACGGCGGAAGTCACGA 

CCTCTCCAGCCAGGGAGTCTGGTCTCAAGGCCGGATGGGGAGGAAGATGTTTTGTAATCTTT 

TTTTCCCCATGTGCTTTAGTGGGCTTTGGTTTTCTTTTTGTGCGAGTGTGTGTGAGAATGGC 

TGTGTGGTGAGTGTGAACTTTGTTCTGTGATCATAGAAAGGGTATTTTAGGCTGCAGGGGAG 

GGCAGGGCTGGGGACCGAAGGGGACAAGTTCCCCTTTCATCCTTTGGTGCTGAGTTTTCTGT 

AACCCTTGGTTGCCAGAGATAAAGTGAAAAGTGCTTTAGGTGAGATGACTAAATTATGCCTC 

CAAGAAAAAAAAATTAAAGTGCTTTTCTGGGTCAAAAAAAAAAAA 
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FIGURE 93 

MDLiAGLjLKSQFIjCHLVFCYVFIASGLI INTIQLFTLLLWPINKQLiFRKINCRLSYCISSQLV 
MLLEWWSGTECTIFTDPRAYLKYGKENAIWLNHKFEIDFLCGWSLSERFGLLGGSKVLAKK 
ELAYVP 1 1 GWMWYFTEMVFCSRKWEO np KTVATS LQHLRD YP EKYF FL I HCEGTRFTEKKHE 
ISMQVARAKGLPRLKHHLLPRTKGFAITVRSLRNWSAVYDCTLNFRNNENPTLLGVLNGKK 
YHADLYVRRIPLEDIPEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMVPPRRPWTLVN 
WLFWASLVLYPFFQFLVSMIRSGSSLTLASFILVFF\ r ASVGVRWMIGVTEIDKGSAYGNSDS 
KQKLND 
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FIGURE 94 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCG 
GCGCACTCGCTTTCCAGCACCTCAACACGGACTGGGACACGGAAGGTTTTCTTCTTGGGGAA 
GTAAAAGGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTA 
TACAATTGACATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAG 
| GCGAAGTAAATGAGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGT 
TGGTACAAATTCCGTCGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAA 
AAACTTGCAGGAGCATTTTTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAA 
TAAC^GAAAGCTGCTCTACTCATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTT 
TTTCACAGGGTACCTTTAGTGGTTGCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAAC 
TGTATCAGGTTCCTGTATGTCCACTGGTTTTAGCCGAGCAGTACAAACACACAGCTCTAAAT 
TTTTTGAAGAAGATGGATCCTTAAAGGAGGTACATAAGATAAATGAAATGTATGCTTCATTA 
CAAGAGGAATTAAAGAGTATATGCAAAAAAGTGGAAGACAGTGAACAAGCAGTAGATAAACT 
AGTAAAGGATGTAAACAGATTAAAACGAGAAATTGAGAAAAGGAGAGGAG CACAGATT CAGG 
CAG C AAG AG AGAAG AAC ATCC AAAAAGA C C CT C A G G AG AA C A TTTTTCTTTGT C AG G C ATT A 
CGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTTATGTCTTTAAAAAATAGACA 
TGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGTAGACAATCTGACCTTAA 
TGGT AG AAC ACAC TGACATT CC TGAAGCTAGT C C AG CT AG TA CAC C AC AAAT C ATTAAGC AT 
AAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTAGATACACAAGA 
CAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAATGAGCAGCC 
CAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTCCTACA 
TTTTGATCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAACAT 
TTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCAC 
CTGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAAC 
ATCAGATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCT 
TACACAGACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAA 
TCCCAGCACTTAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCC 
TGGGCAACGTATTGAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTAT 
TTTCAAAATATGGAAAGAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAG 
TGATACTTTTTTAGAAGTACATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCA 
ATAAATTTGCAAAACATCATCTAAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 95 

MEGESTSAVL.SGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQ 
KYIPCYQLFSFYNSSGEWEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEH 
FSNQDLVFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSC 
MSTGFSRAVQTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVN 
RLKREIEKRRGAQIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSS 
CNYNHHLDWDNLTLMVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKA 
NTGSSNQDKASKMSSPETDEEIEKMKGFGEYSRSPTF 
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FIGURE 96 

GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCA 
GCGCAGGGCAGCCCAAGCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCC 
CCGCGGCGCCTGCCACCCTTCCCTCCTTCCCCGCGTCCCCGCCTCGCCGGCCAGTCAGCTTG 
CCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCTGCTTCCCTGGGCCG 
CGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCT 
GACGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCA 
ACTCCAACTCCTTCTCCCTCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCC 
GCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAAGGTGGGAACGCGTCCGCCCCGGCCCGCA 
CCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTCAGCGCCGCGCTG 
CTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGG 
CTTCAACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCC 
AGGGTTCTACCTGGTGCTCTCAAGAGATGGAGGAGAAGTACAGCCTGCAAAGTAAAGATGAT 
TT C AAAAGTGTGGT C AGCGAACAGTG CAATCATTTG CAAGCTGTCTTTGCTT CACGTTACAA 
GAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTGAATGATATGT 
TTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTA 
GAGTTGAAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTG 
GGCTCGCCTCCTGGAGCGGATGTTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGT 
ATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAGCCCTTCGGAGATGTCCCTCGCAAA 
TTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGCTTAGCGGT 
TGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCC 
TGTTGAAGATGATCTACTGCTCCCACTGCCGGGGTCTCGTGAGTGTGAAGCCATGTTACAAC 
TACTGCTCAAACATCATGAGAGGCTGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAA 
CAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTAGAGGGTCCTTTCAACATTGAAT 
CGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGATAATAGT 
GTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACG 
AATTTCTCGTTCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGG 
AACGCCCAACCACAGCAGCTGGCACTAGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAA 
CTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAACGTTTGCAACGATGAGAGGAT 
GGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTACCTGT 
TTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGTCCAGGTTGACACC 
AGCAAACCAGACATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGGAAGAT 
GAAGAATG CATACAATGGGAACGACGTGGACTTCTTTG ATATCAGTGATG AAAGTAGTGGAG 
AAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAGTTTGACTACAATGCCACT 
GACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCACA 
GGCCTACCTCCl'CACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAG ATAA T 
TCTCAAACTCTGAGAAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTA 
CCATCCTAGTGACTTTGCTTTTTAAATGAATGGACAACAATGTACAGTTTTTACTATGTGGC 
CAGTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGGAGGAAAAGGGACTGTG 
CATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTC 
ATTTCGTTTGTGGGTTTTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAG 
GGTCCCTTCTTGGCACGTAACATGTACGTATTTCTGAAATATTAAATAGCTGTACAGAAGCA 
GGTTTTATTTATCATGTTATCTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 97 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFMKNDAPLHEINGDHLKICPQ 
GSTCCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMF 
VKTYGHLYMQlsrSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEy 
LECVSKYTEQLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHAL 
LKMIYCSHCRGLVTVKPCYNYCSNIMRGCLANQGDLDFEWITOFIDAMLMVAERLEGPFNIES 
VMDPIDVKISDAIMNMQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEE 
RPTTAAGTSLDRLVTDVKEKLKQAKKFWSSLPSWCNDERMAAGNGNEDDCWNGKGKSRYLF 
AVTGNGLANQGNNPEVQVDTSKPDILI^ 

GSGSGCEYQQCPSEFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 
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FIGURE 98 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCT 
GACCTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAA 
GCAACTTACAGCTGCACCGACAGTTGu.(jATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGC 
TGCCACTAATGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCAC 
AGGGACCGAGGCCAGGCTTCTAGGAGATGGCTCCAGGAA,GGCGGCCAAGAATGTGAGTGCAA 
AGATTGGTTCCTGAGAGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGC 
AGTGCCCCTGTGATCATTTCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGA 
AAG C C AAAC AAGC AT T C C AGAG C C TG C ' AGCAATTTCTCAAACAATGTCAGCTAAGAAGCTT 
TGCTCTGCCTTTGTAGGAGCTCTGAGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAA 
GACAGTGAGCACACCTACCAGACACTCT'i-CTTCTCCCACCTCACTCTCCCACTGTACCCACC 
CCTAAATCATTCCAGTGCTCTCAAAAAGCATGTTTTTCAAGATCATTTTGTTTGTTGCTCTC 
TCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCTGTGCCCTCCCCTTACCCAGGCTTAGGCTT 
AATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCTAGCTAGTGTCATTTAACCTTAAATGC 
AATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTT^ 
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/""^VO 

FIGURE 99 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRR 
KFMTVSGLPKKQCPCDHFKGWKKTRHQRHHRKPNKHSRACQQFLKQCQLiRSFALPL 
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FIGURE 100 

AATG GCTGTCTTAGTACTTCGCCTGACAGTTGTCCTGGGACTGCTTGTCTTATTCCTGACCT 
GCTATGCAGACGACAAACCAGACAAGCCAGACGACAAGCCAGACGACTCGGGCAAAGACCCA 
AAGCCAGACTTCCCCAAATTCCTAAGCCTCCTGGGCACAGAGATCATTGAGAATGCAGTCGA 
GTTCATCCTCCGCTCCATGTCCAGGAGCACAGGATTTATGGAATTTGATGATAATGAAGGAA 
AACATTCATCAAAGTGACATCCTCAGGACACACCCATGTGGCTCCTGGACAATCCAAGAGCA 
GCCAAATCCTGCTTTTCCAGTTTGGCTCCACAAGTCCTCCAGGACAGAGCCCTCAAAGCAAC 
TCCCAACGAGTTCTCAGGATTCAGGCTCTGGCTTCAACCAAACAGAACTCATTTTGAACACC 
CTGACTGCATTTTTGCTTTTAGAAAGTTAGAATAAATATGGCGCTTTGGGATCACATAGTTG 
ATGGAGAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 101 



MAVLVLRLTWLGLLVLFLTCYADDKPDKPDDKPDDSGKDPKPDFPKFLSLLGTEIIENAVE 
F I LRSMSRSTGFME FDDNEGKHS S K 
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FIGURE 102 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCT 
CAGAGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTAC 
CCTGCCCCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCC 
CCTACCTGATGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAG 
CTCTTCAGCCAGATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGG 
CTGCGGAACCGGAGCCAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACC 
CAAATCCCCACTTTGAGAAGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATAT 
GAGCGGTTTGTGGTGGCTCCTGGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGT 
GGTGGTCTGCACTCTGGTGCTGTGCTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCC 
GGAGAGTACTGAGACCGGGAGGTGTGCTCTTTTTCTGGGAGCATGTGGCAGAACCATATGGA 
AGCTGGGCCTTCATGTGGCAGCAAGTTTTCGAGCCCACCTGGAAACACATTGGGGATGGCTG 
CTGCCTCACCAGAGAGACCTGGAAGGATCTTGAGAACGCCCAGTTCTCCGAAATCCAAATGG 
AACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTGGGCCCCACATCATGGGAAAGGCTGTC 
AAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCTTCCCCAGCCTCCAATTAGAACA 
AGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTAGCAGAATGAGAGAAGACATT 
CATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGCAATCTCTAACTTCAATC 
CCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGGAAACACTAGGACCC 
TGTTGTATCCTCAACTGGAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTCCCAATGTTGTC 
CCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACACCCATGCGT 
CTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGACCCTCT 
CTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGGAT 
GCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGA 
AACCACG 
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FIGURE 103 



MDIIiVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPyLMAVLTPKSNRKMESKKRELFSQI 
KGLTGASGKVALL.ELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFW 
APGEDMRQLADGSMDVVVCTLVLCSVQS PRKVLQEVRRVLRPGGVLFFWEHVAE PYGSWAFM 
WQQVFEPTWKHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFP 
SSKALICSFPSLQLEQATHQPIYLPLRGT 
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FIGURE 104 

GTGGGATTTATTTGAGTGCAAGATCGTTTTCTCAGTGGTGGTGGAAGTTGCCTCATCGCAGG 

CAGATGTTGGGGCTTTGTCCGAACAGCTCCCCTCTGCCAGCTTCTGTAGATAAGGGTTAAAA 

ACTAATATTT AT ATG AC AGAAGAAAAAGATGTCATT C CG T AAAGTAAAC AT CATC AT CTTGG 

TCCTGGCTGTTGCTCTCTTCTTACTGGTTTTGCACCATAACTTCCTCAGCTTGAGCAGTTTG 

TTAAGGAATGAGGTTACAGATTCAGGAATTGTAGGGCCTCAACCTATAGACTTTGTCCCAAA 

TGCTCTCCGACATGCAGTAGATGGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATCTG 

AAGACAGGCTTGGGGGGGCCATTGCAGCTATAAACAGCATTCAGCACAACACTCGCTCCAAT 

GTGATTTTCTACATTGTTACTCTCAACAATi-^AGCAGACCATCTCCGGTCCTGGCTCAACAG 

TGATTCCCTGAAAAGCATCAGATACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAA 

AAGTAAAGGAGGATCCTGACCAGGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTAC 

TTGCCAATTCTGGTTCCCAGCGCAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCA 

AGGTGATATTCTTGCCCTTTACAATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAG 

AAGATTGTGATTCAGCCTCTACTAAAGTTGTCATCCGTGGAGCAGGAAACCAGTACAATTAC 

ATTGGCTATCTTGACTATAAAAAGGAAAGAATTCGTAAGCTTTCCATGAAAGCCAGCACTTG 

CTCATTTAATCCTGGAGTTTTTGTTGCAAACCTGACGGAATGGAAACGACAGAATATAACTA 

ACCAACTGGAAAAATGGATGAAACTCAATGTAGAAGAGGGACTGTATAGCAGAACCCTGGCT 

GGTAGCATCACAACACCTCCTCTGCTTATCGTATTTTATCAACAGCACTCTACCATCGATCC 

TATGTGGAATGTCCGCCACCTTGGTTCCAGTGCTGGAAAACGATATTCACCTCAGTTTGTAA 

AGGCTGCCAAGTTACTCCATTGGAATGGACATTTGAAGCCATGGGGAAGGACTGCTTCATAT 

ACTGATGTTTGGGAAAAATGGTATATTCCAGACCCAACAGGCAAATTCAACCTAATCCGAAG 

ATATACCGAGATCTCAAACATAAAGTGAAACAGAATTTGAACTGTAAGCAAGCATTTCTCAG 

GAAGT C CTGG AAGATAGCATGCATGGG AAGTAACAGTTG CT AGGCTTCAATG C CTAT CGGTA 

GCAAGCCATGGAAAAAGATGTGTCAGCTAGGTAAAGATGACAAACTGCCCTGTCTGGCAGTC 

AGCTTCCCAGACAGACTATAGACTATAAATATGTCTCCATCTGCCTTACCAAGTGTTTTCTT 

ACTACAATGCTGAATGACTGGAAAGAAGAACTGATATGGCTAGTTCAGCTAGCTGGTACAGA 

TAATTCAAAACTGCTGTTGGTTTTAATTTTGTAACCTGTGGCCTGATCTGTAAATAAAACTT 

ACATTTTTC 
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FIGURE 105 



MS FRKVNI I ILVLAVALFLLVLHHNFLS LSSLLRNE VTDS G I VG PQ P I DF VPNALRHAVDGR 
QEE I P WI AASEDRLGGAI AAI NS I QHNTRSNVI FY I VTLNNTADHLRS WLNSDSLKS I RYK 
IVNFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVPSAKKAIYMDDDVIVQGDILALYNT 
ALKPGHAAAFSEDCDSASTKWIRGAGNQYNYIGYLDYKKERIRKLSMKASTCSFNPGVFVA 
NLTE WKRQN I TNQLEKWMKLNVEEGL YS RTLAGS ITTPPLLI VF YQQHST I DPMWNVRHLGS 
SAGKRYS PQFVKAAKLLHWNGHLKPWGRTAS YTDVWEKWY I PDPTGKFNL I RRYTE ISNTK 
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FIGURE 106 

TGGTTTTTGCCCCATAAATTCCCTCAGCTTGAGCAGTTTGTTAAGGAATGAGGTTACAGATT 
CAGGAATTNTAGGNCCTCAACCTNTAGANTTTGTCCCAAATGTTCTCCGACATGCAGTAGAT 
GGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATNTGAAGACAGGCTTGGGGGGGCCAT 
TGCAGCTATAAACAGCATTCAGCACAACACTCGNTCCAATGTGATTTTCTACATTGTTACTC 
TCAACAATACAGCAGACCATNTCCGGTCCTGGNTCAACAGTGATTCCCTGAAAAGCATCAGA 
TACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCA 
GGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGTTCCCAGCG 
CAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTAC 
AATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTAC 
TAAAGTTGTCATCCGTGGAGCAGGAAA 
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FIGURE 107 



CGACGCTCTAGCGGTTACCGCTGCGGGCTGGCTGGGCGTAGTGGGGCTGCGCGGCTGCCACG 
GAGCTAGAGGGCAAGTGTGCTCGGCCCAGCGTGCAGGGAACGCGGGCGGCCAGACAACGGGC 
TGGGCTCCGGGGCCTGCGGCGCGGGCGCTGAGCTGGCAGGGCGGGTCGGGGCGCGGGCTGCA 
TCCGCATCTCCTCCATCGCCTGCAGTAAGGGCGGCCGCGGCGAGCCTTTGAGGGGAACGACT 
TGTCGGAGCCCTAACCAGGGGTGTCTCTGAGCCTGGTGGGATCCCCGGAGCGTCACATCACT 
TTCCGATCACTTCAAAGTGGTTAAAAACTAATATTTATATGACAGAAGAAAAAGATGTCATT 
CCGTAAAGTAAACATCATCATCTTGGTCCTGGGCTGTTGCTCTCTTCTTACTGGTTTTGCAC 
CATAACTT C C T C AG CTTGAGGC AGTTTGTTAAGG AATGAGGTTACAGATT C AGG AATTGT AG 
GGCCTCAACCTATAGGACTTTGTCCCAAATGCTCTCCGACATGCAGTAGATGGGAGACAAGA 
GGAGATTCCTGTGGTCATCGCTGCATCTGAAGACAGGCTTGGGGGGGCCATTGCAGCTATAA 
ACAGCATTCAGCACAACACTCGci'UCAATGTGATTTTCTACATTGTTACTCTCAACAATACA 
GCAGACCATCTCCGGTCCTGGGCTCAACAGTGATTCCCTGAAAAGCATCAGATACAAAATTG 
TCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCAGGGGGAATCC 
g. ATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGGTTCCCAGCGCAAAGAAGG 
CCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTACAATACAGCA 
CTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTACTAAAGTTGT 
CATCCGTGGAGCAGGAAACCAGTACAATTACATTGGCTATCTTGACTATAAAAAGGAAAGAA 
TTCGTAAGCTTTCCATGAAAGCCAGCACTTGCTCATTTAATCCTGGAGTTTTTGTTGGAAAC 
CTGACGGAATGGAAACGACAGAATATAACTAACCAACTGGAAAAATGGATGAAACTCAATGT 
AGAAGAGGGACTGTATAGCAGAACCCTGGCTGGTAGCATCACAACACCTCCTCTGCTTATCG 
TATTTTATCAACAGCACTCTACCATCGATCCTATGTGGAATGTCCGCCACCTTGGTTCCAGT 
GCTGGAAAACGATATTCACCTCAGTTTGTAAAGGCTGCCAAGTTACTCCAT f GGAATGGACA 
TTTGAAGCCATGGGGAAGGACTGCTTCATATACTGATGTTTGGGGAAAAATGGTATATTCCA 
GACCCAACAGGCAAATTCAACCTAATCCGAAGATATACCGAGATCTCAAAC^TAAAGTGAAA 
CAGAATTTGAACTGTAAGCAAGCATTTCTCAGGAAGTCCTGGAAGATAGCATGCGTGGGAAG 
TAACAGTTGCTAGGCTTCAATGCCTATCGGTAGCAAGCCATGGAAAAAGATGTGTCAGCTAG 
GTAAAGATGACAAACTGCCCTG77TGGCAGTCAGCTTCCCAGACAGACTATAGACTATAAAT 
ATGTCTCCATCTGCCTTACCAAGTGTTTTCTTACTACAATGCTGAATGACTGGAAAGAAGAA 
CTGATATGGCTAGTTCAGCTAGCTGGTACAGATAATTCAAAACTGCTGTTGGTTTTAATTTT 
m GTAACCTGTGGCCTGATCTGTAAATAAAACTTACATTTTTCAATAGGTAAA^ AAAAAAAAAA 

AAAAAA 
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FIGURE 108 



CTGCAGGTAGACATCTCCACTGCCCAGGAATCACTGAGCGTGCAGACAGCACAGCCTCCTCT 
GAAGGCCGGCCATACCAGAGTCCTGCCTCGGCATGGGCCTCACCATTGAGGCAGCTCCACTG 
TCTGTGCTGGTCTGAGGGTGCTGCCTGTCATGGGGGCAGCCATCTCCCAGGGGGCCCTCATC 
GCCATCGTCTGCAACGGTCTCGTGGGCTTCTTGCTGCTGCTGCTCTGGGTCATCCTCTGCTG 

It GGCCTGCCATTCTCGTCTGCCGACGTTGACTCTCTCTCTGAATCCAGTCCCAACTCCAGCCC 
TGGCCCCTGTCCTGAGAAGGCCCCACCACCCCAGAAGCCCAGCCATGAAGGCAGCTACCTGC 
TGCAGCCCTGAAGGCCCCTGGCCTAGCCTGGAGCCCAGGACCTAAGTCCACCTCACCTAGAG 
CCTGGAATTAGGATCCCAGAGTTCAGCCAGCCTGGGGTCCAGAACTCAAGAGTCCGCCTGCT 
TGGAGCTGGACCCAGCGGCCCAGAGTCTAGCCAGCTTGGCTCCAATAGGAGCTCAGTGGCCC 
TAAGGAGATGGGCCTGGGGTGGGGGCTTATGAGTTGGTGCTAGAGCCAGGGCCATCTGGACT 
ATGCTCCATCCCAAGGGCCAAGGGTCAGGGGCCGGGTCCACTCTTTCCCTAGGCTGAGCACC 
TCTAGGCCCTCTAGGTTGGGGAAGCAAACTGGAACCCATGGCAATAATAGGAGGGTGTCCAG 

P GCTGGGCCCCTCCCCTGGTCCTCCGAGTGTTTGCTGGATAATAAATGGAACTATGGCTCT7^A 
AAAAAAAAAAAAAAAAA 
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FIGURE 109 

MGAAISQGALIAIVCNGLVGFLL.LLLWVILCWACHSRLPTLTLSLNPVPTPALAPVLRRPHH 
PRSPAMKAATCCSPEGPWPSLEPRT 
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FIGURE 110 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCA 
GTTCCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTA 
CTCCCTATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAA 
TCATGTCGGGAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCC 
ATGATGTTTACCTTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTAT 
TTTGGGATTGTTGTTTGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACC 
TCAGCATAGAATTGGACACAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTA 
TCCACAGGCATCACGGCAGTGCTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATT 
GACAGTTGAGCTTTTCCAAATCACAAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCC 
AGCCACTGTGGACATTTGCCATCCTCATTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTG 
AGCCTGGGAACTGCAGGAGCTGCCCAGGTTATGGAAGGCGGCCAAGTGGAATATAAGCCCCT 
TTCGGGCATTCGGTACATGTGGTCGTACCATTTAATTGGCCTCATCTGGACTAGTGAATTCA 
TCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAGTGGTTACTTGTTATTTCAACAGAAGT 
AAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTCTCCATTCTCTTCTTCTACCATCA 
AGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAGGATTCCGAGAATCATTGTCA 
TGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGTCCAGGTACCTGTTCCGA 
TGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTCAACCAGAATGCATA 
TACTACAACTGCTATTAATGGGACAGATTTCTGTACATCAGCAAAAGATGCATTCAAAATCT 
TGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAATTTTTCTA 
GGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTACAATCG 
GGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGCCC 
ATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGAT 
CTGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTT 
CGTAAAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGA 
ATGAGGAGGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCT 
GGAAAACATTTCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTT 
AGTGAATTTTTTTTTAAAAGACCTAATAAACCCTATTCTTCCTCAAAlA 
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FIGURE 111 

MS GRDT I LGLC I L ALALS LAMMFTFRF I TTLLVH IFISLVI LGL LF VCGVLWWL YYD YTNDL 
SIELDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQ 
PLWTFAILIFFWVIiWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFI 

IjACQQMT I agawtcyfnrs kndp pdhp I lssls I lffyhqgtwkgs fliswri pri I vm 

YMQNALKEQQHGALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKIL 
SKNSSHFTSINCFGDFIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAH 
SFIiSVFETVLDALFLCFAVDLETNDGSSEKPYFMDQEFLSFVKRSNKLNKARAQQDKHSIiRN 
EEGTELQAIVR 
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FIGURE 112 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTT 
AGAATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTG 
TGTGGTGAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATT 
TATGAGGACTGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGG 
TGACTGGAGTACATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTG 
CCTCAGATCAACTGCGATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATG 
TCCAGCAGGATGCCAAGACCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACT 
CCAGTGTGTGTGGCGCTGCCGTACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTT 
GTTCGGAAGGTTGCTGGACAGTCTGGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTT 
ATCCCTACCACGATGGAGAGAATCCTTTATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAA 
CCTACCCATCAGCTCTTACATACTCATCATCG7VAAAGTCCAGCTGCCCAAGCAGGTGAGACC 
ACAAAAGCCTATCAGAGGCCACCTATTCCAGGGACAACTGCACAGCCGGTCACTCTGATGCA 
GCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCACCACCTTGCCAAGGCCATCCCCTTCTG 
CTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGGGCCACAGGAGCCAGGAGATGGAT 
CTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCCAGAGCTGATCCAGGTATCCA 
AAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGCGGATGTCAGCCTGGGAC 
TTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCCTGGGAGATCCAAAC 
TGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAACGGCGATTCCG 
AATCCAGAAGCAGCTCCTGGCTCAIGTTGCCCAAGCTCTTGACATTGGCCCTGCCGGTCCAC 
TGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACACACACG 
AATTCTCGAGATCTGAAGACAGCCATAGAGAAAATTACTCAGAGAGGAGGACTTTCTAATGT 
AGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCG 
GGGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCT 
TCAAGACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGA 
AAATGAGAAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACG 
GCTTCTACTCGCTCCACGTGCAGAGCTGGTTTGGCCTCCAGAAGACCCTGCAGCCTCTGGTG 
AAGCGGGTCTGCGACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACAT 
TGGCTTCGTCATCGACGGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGT 
TTGTGACCAACCTCACCAAAGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTG 
CAGTACACCTACGAACAGCGGCTGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACAT 
CCTCAACGCCATCAAGAGGGTGGGCTACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCA 
ACTTCGCCCTGGAGCAGCTCTTCAAGAAGTCCAAGCCCAACAAGAGGAAGTTAATGATCCTC 
ATCACCGACGGGAGGTCCTACGACGACGTCCGGATCCCAGCCATGGCTGCCCATCTGAAGGG 
AGTGATGACCTATGCGATAGGCGTTGCCTGGGCTGCCCAAGAGGAGCTAGAAGTCATTGCCA 
CTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGAGTTTGACAACCTCCATCAGTATGTC 
CCCAGGATGATCCAGAACATTTGTAGAGAGTTCAACTCACAGCCTCGGAACTGAATTCAGAG 
CAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTTGGACCACCCCACCGCTTAA 
TGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAACAAATGTCTTGTTATTA 
TTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGATGATCACAAACGT 
ATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATTTTACA'xTTTGACAATT 
GTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAGAGCTTTT 
GTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCATTTTT 
GTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAi^AAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 113 



MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKC 
PAGCQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSL 
SL»PRWRES F I VLES KP KKGVTYPS ALTYSS SKS PAAQAGE TT KAYQRP P I PGTTAQP VTLMQ 
LLAVTVAVATPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQ 
RQDPSGAAFQKPVGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFR 
IQKQLLADVAQALDIGPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNV 
GRAISFVTKNFFSKANGNRSGAPNVVVVMVD^ 

NEKQYWEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADI 
GFVIDGSSSVGTGNFRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDI 
LNAIKRVGYWSGGTSTGAAINFALEQLFKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKG 
VI TYA I GVAWAAQEELEVI ATHPARDHS FFVDE FDNLHQ YVPR 1 1 QNI CTEFNSQPRN 
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FIGURE 114 

CAGGATGAACTGGTTGCAGTGGCTGCTGCTGCTGCGGGGGCGCTGAGAGGACACGAGCTCTA 
TGCCTTTCCGGCTGCTCATCCCGCTCGGCCTCCTGTGCGCGCTGCTGCCTCAGCACCATGGT 
GCGCCAGGTCCCGACGGCTCCGCGCCAGATCCCGCCCACTACAGTTTTTCTCTGACTCTAAT 
TGATGCACTGGACACCTTGCTGATTTTGGGGAATGTCTCAGAATTCCAAAGAGTGGTTGAAG 
TGCTCCAGGACAGCGTGGACTTTGATATTGATGTGAACGCCTCTGTGTTTGAAACAAACATT 
CGAGTGGTAGGAGGACTCCTGTCTGCTCATCTGCTCTCCAAGAAGGCTGGGGTGGAAGTAGA 
GGCTGGATGGCCCTGTTCCGGGCCTCTCCTGAGAATGGCTGAGGAGGCGGCCCGAAAACTCC 
TCCCAGCCTTTCAGACCCCCACTGGCATGCCATATGGAACAGTGAACTTACTTCATGGCGTG 
AACCCAGGAGAGACCCCTGTCACCTGTACGGCAGGGATTGGGACCTTCATTGTTGAATTTGC 
CACCCTGAGCAGCCTCACTGGTGACCCGGTGTTCGAAGATGTGGCCAGAGTGGCTTTGATGC 
GCCTCTGGGAGAGCCGGTCAGATATCGGGCTGGTCGGCAACCACATTGATGTGCTCACTGGC 
AAGTGGGTGGCCCAGGACGCAGGCATCGGGGCTGGCGTGGACTCCTACTTTGAGTACTTGGT 
GAAAGGAGCCATCCTGCTTCAGGATAAGAAGCTCATGGCCATGTTCCTAGAGTATAACAAAG 
CCATCCGGAACTACACCCGCTTCGATGACTGGTACCTGTGGGTTCAGATGTACAAGGGGACT 
GTGTCCATGCCAGTCTTCCAGTCCTTGGAGGCCTACTGGCCTGGTCTTCAGAGCCTCATTGG 
AGACATTGACAATGCCATGAGGACCTTCCTCAACTACTACACTGTATGGAAGCAGTTTGGGG 
GGCTCCCGGAATTCTACAACATTCCTCAGGGATACACAGTGGAGAAGCGAGAGGGCTACCCA 
CTTCGGCCAGAACTTATTGAAAGCGCAATGTACCTCTACCGTGCCACGGGGGATCCCACCCT 
CCTAGAACTCGGAAGAGATGCTGTGGAATCCATTGAAAAAATCAGCAAGGTGGAGTGCGGAT 
TTGCAACAATCAAAGATCTGCGAGACCACAAGCTGGACAACCGCATGGAGTCGTTCTTCCTG 
GCCGAGACTGTGAAATACCTCTACCTCCTGTTTGACCCAACCAACTTCATCCACAACAATGG 
GTCCACCTTCGACGCGGTGATCACCCCCTATGGGGAGTGCATCCTGGGGGCTGGGGGGTACA 
TCTTCAACACAGAAGCTCACCCCATCGACCTTGCCGCCCTGCACTGCTGCCA^AGGCTGAAG 
GAAGAGCAGTGGGAGGTGGAGGACTTGATGAGGGAATTCTACTCTCTCAAACGGAGCAGGTC 
GAAATTTCAGAAAAACACTGTTAGTTCGGGGCCATGGGAACCTCCAGCAAGGCCAGGAACAC 
TCTTCTCACCAGAAAACCATGACCAGGCAAGGGAGAGGAAGCCTGCCAAACAGAAGGTCCCA 
CTTCTCAGCTGCCCCAGTCAGCCCTTCACCTCCAAGTTGGCATTACTGGGACAGGTTTTCCT 
AGACTCCTCATAACCACTGGATAATTTTTTTATTTTTATTTTTTTGAGGCTAAACTATAATA 
AATTGCTTTTGGCTATCATAAAA 
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FIGURE 115 

MPFRLLI PLGLLCALLPQHHGAPGPDGSAPDPAHYS FSLTLI DALDTLLI LGNVSEFQRVVE 
VL.QDSVDFDIDVNASVFETNIRWGGLLSAHLLSKKAGVEVEAGWPCSGPLLRMAEEAARKL 
L PAFQTPTGMP YGTVNLLHG VNPGETP VTCTAG I GT F I VE FATLSS LTGD PVFE DVARVALM 
RLWESRSDIGLVGNHIDVLTGKWVAQDAGIGAGVDSYFEYLVKGAILLQDKKLMAMFLEYHK 

^. AIRNYTRFDDWYLWVQMYKGTVSMPVFQSLEAYWPGLQSLIGDIDNAMRTFLNYYTVWKQFG 
GLPEFYNIPQGYTVEKREGYPLRPELIESAMYLYRATGDPTLLELGRDAVESIEKISKVECG 
FATIKDLRDHKLDNRMESFFLAETVKYLYLLFDPTNFIHNNGSTFDAVITPYGECILGAGGY 

4 IFNTEAHPIDLAALHCCQRLKEEgWEVEDLI^^EFYSLKRSRSKFQKNTVSSGPWEPPARPGT 
LFSPENHDQARERKPAKQKVPLLSCPSQPFTSKLALLGQVFLDSS 
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FIGURE 116 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 

GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 

ATTCAGGCTTCGCTGCGAGTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 

GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 

AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 

ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 

TCTGTACTCTCAACCAACATGAAGCATGTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 

AACAGT^TACTATTCTCaTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 

GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 

ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 

GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 

TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 

CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 

GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 

CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 

GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 

GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 

TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 

GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 

CTCATAGGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 

ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 

GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 

TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 

CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 

TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 

TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 

TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 

GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 

TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 

AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 

CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 

GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 

TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 

AAAAAAAAAA 
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FIGURE 117 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTISnyiKHLLMWSPVlAPGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
SILKHPPNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEF.LVAYWRREPGAEEHVKMVRSG 
G I PVHLETME PGAA YCVKAQTF VKA I GR YS AFSQTE CVE VQGE A I P LVLALFAFVGFML I LV 
WPLFVWKMGRLLQ YS CC P WVL PDTLK I TNSPQKL I S CRREE VDACAT AVMS PEELLRAW I S 

Important features : 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation sites. 

amino acids 40-43 and 134-137 

Tissue factor proteins homology. 

amino acids 92-119 



Integrins alpha chain protein homology. 

amino acids 232-262 
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FIGURE 118 

TCCTGCTGATGCACATCTGGGTTTGGCT^AAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCCAC 
ATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 
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FIGURE 119 

CGGACGCGTGGGCCGCCACCTCCGGAACAAGCCATGGTGGCGGCGACGGTGGCAGCGGCGTG 
GCTGCTCCTGTGGGCTGCGGCCTGCGCGCAGCAGGAGCAGGACTTCTACGACTTCAAGGCGG 
TCAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGTCCCTGGTGGTG 
AATGTGGCCAGCGAGTGCGGCTTCACAGACCAGCACTACCGAGCCCTGCAGCAGCTGCAGCG 
AGACCTGGGCCCCCACCACTTTAACGTGCTCGCCTTGCCCTGCAACCAGTTTGGCCAACAGG 
AGCCTGACAGCAACAAGGAGATTGAGAGCTTTGCCCGCCGCACCTACAGTGTCTCATTCCCC 
ATGTTTAGCAAGATTGCAGTCACCGGTACTGGTGCCCATCCTGCCTTCAAGTACCTGGCCCA 
GACTTCTGGGAAGGAGCCCACCTGGAACTTCTGGAAGTACCTAGTAGCCCCAGATGGAAAGG 
TGGTAGGGGCTTGGGACCCAACTGTGTCAGTGGAGGAGGTCAGACCCCAGATCACAGCGCTC 
GTGAGGAAGCTCATCCTACTGAAGCGAGAAGACTTATAACCACCGCGTCTCCTCCTCCACGA 
CCTCATCCCGCCCACCTGTGTGGGGCTGACCAATGCAAACTCAAATGGTGCTTCAAAGGGAG 
AGACCCACTGACTCTCCTTCCTTTACTCTTATGCCATTGGTCCCATCATTCTTGTGGGGGAA 
AAATTGTAGTATTTTGATTATTTGAATCTTACAGCAACAAATAGGAACTCCTGGCCAATGAG 
AGCTCTTGACCAGTGAATCACCAGGCGATACGAACGTCTTGCCAACAAAAATGTGTGGCAAA 
TAGAAGTATATCAAGCAATAATCTCCCACCCAAGGCTTCTGTAAACTGGGACCAATGATTAC 
CTCATAGGGCTGTTGTGAGGATTAGGATGAAATACCTGTGAAAGTGCCTAGGCAGTGCCAGC 
CAAATAGGAGGCATTCAATGAACATTTTTTGCATATAAACCAAAAAATAAGTTGTTATCAAT 
AAAAACTTGCATCCAACATGAATTTCCAGCCGATGATAATCCAGGCCAAAGGTTTAGTTGTT 
GTTATTTCCTCTGTATTATTTTCTTCATTACAAAAGAAATGCAAGTTCATTGTAACAATCCA 
AACAATACCTCACGATATAAAATAAAAATGAAAGTATCCTCCTCAAAAA 
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FIGURE 120 

IWAATVAAAWLLLWAAACAQQEQDFYDFKAWIRGKLVSLEKYRGSVSLWNVASECGFTD^ 
HYRALQQLQRDLGPHHFNVLAFPCNQFGQQEPDSNKEIESFARRTYSVSFPMFSKIAVTGTG 
AHPAFKYLAQTSGKEPTWNFWKYLVAPDGKWGAWDPTVSVEEVRPQITALVRKLILLKREDL 
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FIGURE 121 

CGGACGCGTGGGCGGGCCGGGACGCAGGGCAAAGCGAGCCATfGGCTGTCTACGTCGGGATGC 
TGCGCCTGGGGAGGCTGTGCGCCGGGAGCTCGGGGGTGCTGGGGGCCCGGGCCGCCCTCTCT 
CGGAGTTGGCAGGAAGCCAGGTTGCAGGGTGTCCGCTTCCTCAGTTCCAGAGAGGTGGATCG 
CATGGTCTCCACGCCCATCGGAGGCCTCAGCTACGTTCAGGGGTGCACCAAAAAGCATCTTA 

'I ACAGCAAGACTGTGGGCCAGTGCCTGGAGACCACAGCACAGAGGGTCCCAGAACGAGAGGCC 
TTGGTCGTCCTCCATGAAGACGTCAGGTTGACCTTTGCCCAACTCAAGGAGGAGGTGGACAA 
AGCTGCTTCTGGCCTCCTGAGCATTGGCCTCTGCAAAGGTGACCGGCTGGGCATGTGGGGAC 

J| CTAACTCCTATGCATGGGTGCTCATGCAGTTGGCCACCGCCCAGGCGGGCATCATTCTGGTG 
TCTGTGAACCCAGCCTACCAGGCTATGGAACTGGAGTATGTCCTCAAGAAGGTGGGCTGCAA 
GGCCCTTGTGTTCCCCAAGCAATTCAAGACCCAGCAATACTACAACGTCCTGAAGCAGATCT 

^ GTCCAGAAGTGGAGAATGCCCAGCCAGGGGCC TGAAGAGTCAGAGGCTCCCAGATCTGACC 

ACAGTCATCTCGGTGGATGCCCCTTTGCCGGGGACCCTGCTCCTGGATGAAGTGGTGGCGGC 
TGGCAGCACACGGCAGCATCTGGACCAGCTCCAATACAACCAGCAGTTCCTGTCCTGCCATG 

"] ACCCCATCAACATCCAGTTCACCTCGGGGACAACAGGCAGCCCCAAGGGGGCCACCCTCTGC 
CACTACAACATTGTCAACAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGAC 
ACCAGAGCAGTTGCGGATGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAG 
GCACAATGATGTGTCTGATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGC 
AAGAAGGCACTGGAGGCCATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGAT 

.;g GTTCGTGGACATTCTGAACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAG 

1 GTGTCATTGCTGGGTCCCCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAAT 

^ ATGAAGGACCTGGTGGTTGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTT 
CCCTGAGGACACTGTGGAGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACAGGGAGG 
CCCGGATCATGAACATGGAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGC 
ATCCGAGGGTACTGCGTCATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAA.GCAGT 
GGATCAGGACAAGTGGTATTGGACAGGAGATGTCGCCACAATGAATGAGCAGGGCTTCTGCA 
AGATCGTGGGCCGCTCTAAGGATATGATCATCCGGGGTGGTGAGAACATCTACCCCGCAGAG 
CTCGAGGACTTCTTTCACACACACCCGAAGGTGCAGGAAGTGCAGGTGGTGGGAGTGAAGGA 
CGATCGGATGGGGGAAGAGATTTGTGCCTGCATTCGGCTGAAGGACGGGGAGGAGACCACGG 
TGGAGGAGATAAAAGCTTTCTGCAAAGGGAAGATCTCTGACTTCAAGATTCCGAAGTACATC 
GTGTTTGTCACAAACTACCCCCTCACCATTTCAGGAAAGATCCAGAAATTCAAACTTCGAGA 
GCAGATGGAACGACATCTAAATCTGTGAATAAAGCAGCAGGCCTGTCCTGGCCGGTTGGCTT 
GACTCTCTCCTGTCAGAATGCAACCTGGCTTTATGCACCTAGATGTCCCCAGCACCCAGTTC 
TG AG C C AGGCAC AT CAAATGTCAAGGAATTGAGTGAACGAACT AAG AG CT C CTGGATGGGTC 
CGGGAACTCGCGTGGGCACAAGGTGCCAAAAGGCAGGCAGCCTGCCCAGGCCCTCCCTCCTG 

^ TCCATCCCCCACATTCGCCTGTCTGTCCTTGTGATTTGGCATAAAGAGCTTCTGTTTTCTTT 
GAAAAAAAAAAAAAAAA 



BNSDOCID: <WO 996308 8 A2 I 



WO 99/63088 




PCT/US99/12252 



FIGURE 122 

MAVYVGMLRLGRLCAGSSGVLGARAALSRSWQEARLQGVRFLSSREVDRMVSTPIGGLSYVQ 
GCTKKHLNSKTVGQCLETTAQRVPEREALWLHEDVRLTFAQLKEEVDKAASGLLSIGLCKG 
DRLGMWG PNS YAWVLMQL ATAQ AG 1 1 LVSVNPAYQAMELE YVLKKVGCKALVFPKQFKTQQY 
YNVLKQICPEVENAQPGALKSQRLPDLTTVISVDAPLPGTLLLDEVVAAGSTRQHLDQLQYN 
QQFLSCHDPINIQFTSGTTGSPKGATLSHYNIVNNSNILGERLKLHEKTPEQLRMILPNPLY 
HCLGS VAGTMMCLMYGATL I LASP I FNGKKALEA I SRERGTFLYGTPTMFVD I LNQPDFSS Y 
DISTMCGGVIAGSPAPPELIRAIINKINMKDLWAYGTTENSPVTFAHFPEDTVEQKAESVG 
RIMPHTEARIMNMEAGTLAKLNTPGELCIRGYCVMLGYWGEPQKTEEAVDQDKWYWTGDVAT 
MNEQGFCKIVGRSKDMIIRGGENIYPAELEDFFHTHPKVQEVQVVGVKDDRMGEEICACIRL 
KDGEETTVEEIKAFCKGKISHFKIPKYIVFVTNYPLTISGKIQKFKLREQMERHLNL 
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FIGURE 123 

CAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGACACCAGAGCAGTTGCGGA 
TGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAGGCACAATGATGTGTCTG 
ATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGCAAGAAGGCACTGGAGGC 
CATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGATGTTCGTGGACATTCTGA 
ACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAGGTGTCATTGCTGGGTCC 
C CTGCACCTCCAGAGTTG AT CCGAGC CATC AT CAACAA GAT AAATATGAAGGACCTGGTGGT 
TGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTTCCCTGAGGACACTGTGG 
AGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGGCGCGGATCATGAACATG 
GAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGCATCCGAGGGTACTGCGT 
CATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGTGGATCAGGACAAGTGGT 
ATTGGACAGGAGATGTCGCCAC 
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FIGURE 124 

GAGCAGGACGGAGCCATGGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGC 
AGGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCG 
TGCAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCGCCGGGC 
GTGGACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGC 
AGTGCGGGGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCGGCCTGGATCTTCACGGGC 
TTCTGGCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTC 
ACCTCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTG 
CTACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGCT 
GCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACGGCA 
GCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACTCGGGA 
TGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGCTGTAACT 
CTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGGCTGCCCCCT 
CCAGAGCCCACGACTGTGGCCTCAACCACATCTGTCACCACTTCTACCTCGGCCCCAGTGAG 
ACCCACATCCACCACCAAACCCATGCCAGCGCCAACCAGTCAGACTCCGAGACAGGGAGTAG 
AACACGAGGCCTCCCGGGATGAGGAGCCCAGGTTGACTGGAGGCGCCGCTGGCCACCAGGAC 
CGCAGCAATTCAGGGCAGTATCCTGCAAAAGGGGGGCCCCAGCAGCCCCATAATAAAGGCTG 
TGTGGCTCCCACAGCTGGATTGGCAGCCCTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTGT 
GAGCTTCTCCACCTGGAAATTTCCCTCTCACCTACTTCTCTGGCCCTGGGTACCCCTCTTCT 
CATCACTTCCTGTTCCCACCACTGGACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCC 
CAGTATCCCCAGCTTCTGCTGCGCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATAT 
ATTCTGCCAGGGGTGTTCTAGCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTC 
TCCGCTTGTCCTCTTGTGATGTTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTG 
AGAGAGAGGATGCTAAGCTTCCTACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGG 
GGTGGGTGGGACAATGGCTCCCCACTCTAAGCACTGCCTCCCCTACTCCCCGCATCTTTGGG 
GAATCGGTTCCCCATATGTCTTCCTTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTAC 
CCAATTCGCCCTATAGTGAGTCGTA 
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FIGURE 125 

MD P ARKAGAQ AM I WTAGWLLLLLLRGGAQALE C YS C VQ KADDGC S PNKMKT VKC AP GVDVCT 
EAVGAVETIHGQFSLAVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLNLTSRAL 
DPAGNESAYPPNGVECYSCVGLSREACQGTSPPVVS CYNASDHVYKGCFDGNVTLTAANVTV 
SLiPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRLPPPEPTT 
VASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSG 
QYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 
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FIGURE 126 

CGGGACTCGGCGGGTCCTCCTGGGAGTCTCGGAGGGGACCGGCTGTGCAGACGCCATGGAGT 
TGGTGCTGGTCTTCCTCTGCAGCCTGCTGGCCCCCATGGTCCTGGCCAGTGCAGCTGAAAAG 
GAGAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATTGGGGGACTGGTGTT 
CGCTGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAGGTGCAAGTGCAGTT 
TCAATCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCATCACCGCC 
AATGCAACAGAGCCCCAGAAGCAGAGAACTGAAGTGCAGCCATCAGGTGGAAGCCTCTGGAA 
CCTGAGGCGGCTGCTTGAACCTTTGGATGCAAATGTCGATGCTTAAGAAAACCGGCCACTTC 
AGCAACAGCCCTTTCC CC AGGAGAAGCCAAGAACTTGTGTGTCCCCCACGCTATCCCCTCTA 
ACACCATTCCTCCACCTGATGATGCAACTAACACTTGCCTCCCCACTGCAGCCTGCGGTCCT 
GCCCACCTCCCGTGATGTGTGTGTGTGTGTGTGTGTGTGACTGTGTGTGTTTGCTAACTGTG 
GTCTTTGTGGCTACTTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACTGTGGACTCGCTTT 
CCCAGGCAGGGGCTGAGCCACATGGCCATCTGCTCCTCCCTGCCCCCGTGGCCCTCCATCAC 
CTTCTGCTCCTAGGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGGATGC 
GT AG GG T AAG AG CACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGTTTGC AG C AC 
TTTGTCATCATTCTTCATGGACTCCTTTCACTCCTTTAACAAAAACCTTGCTTCCTTATCCC 
ACCTGATCCCAGTCTGAAGGTCTCTTAGCAACTGGAGATACAAAGCAAGGAGCTGGTGAGCC 
CAGCGTTGACGTCAGGCAGGCTATGCCCTTCCGTGGTTAATTTCTTCCCAGGGGCTTCCACG 
AGGAGTCCCCATCTGCCCCGCCCCTTCACAGAGCGCCCGGGGATTCCAGGCCCAGGGCTTCT 
ACTCTGCCCCTGGGGAATGTGTCCCCTGCATATCTTCTCAGCAATAACTCCATGGGCTCTGG 
GACCCTACCCCTTCCAACCTTCCCTGCTTCTGAGACTTCAATCTACAGCCCAGCTCATCCAG 
ATGCAGACTACAGTCCCTGCAATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCC 
GTTGGGGCCAGCACACCGGGATGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACG 
TCCCCTTAGATGGGCAGCAGAGGCAACTCCCGCATCCTTTGCTCTGCCTGTCGGTGGTCAGA 
GCGGTGAGCGAGGTGGGTTGGAGACTCAGCAGGCTCCGTGCAGCCCTTGGGAACAGTGAGAG 
GTTGAAGGTCATAACGAGAGTGGGAACTCAACCCAGATCCCGCCCCTCCTGTCCTCTGTGTT 
CCCGCGGAAACCAACCAAACCGTGCGCTGTGACCCATTGCTGTTCTCTGTATCGTGATCTAT 
CCTCAACAACAACAGAAAAAAGGAATAAAATATCCTTTGTTTCCT 
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MELVLVFLCSLLAPMVLASAAEKEKEMDPFHYDYQTLR I GGLVFAWLFS VG ILLI LSRRCK 
CSFNQKPRAPGDEEAQVENLITANATEPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 
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FIGURE 128 

AAACTTGACGCCATGAAGATCCCGGTCCTTCCTGCCGTGGTGCTCCTCTCCCTCCTGGTGCT 
CCACTCTGCCCAGGGAGCCACCCTGGGTGGTCCTGAGGAAGAAAGCACCATTGAGAATTATG 
CGTCACGACCCGAGGCCTTTAACACCCCGTTCCTGAACATCGACAAATTGCGATCTGCGTTT 
AAGGCTGATGAGTTCCTGAACTGGCACGCCCTCTTTGAGTCTATCAAAAGGAAACTTCCTTT 
1 CCTCAACTGGGATGCCTTTCCTAAGCTGAAAGGACTGAGGAGCGCAACTCCTGATGCCCAGT 



GACCATGACCTCCACTGGAAGAGGGGGCTAGCGTGAGCGCTGATTCTCAACCTACCATAACT 
CTTTCCTGCCTCAGGAACTCCAATAAAACATTTTCCATCCAAA 
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FIGURE 129 

MKIPVLPAWLLSLLVLHSAQGATLGGPEEESTIENYASRPEAFNTPFLNIDKLRSAFKADE 
FLNWHALFESIKRKLPFLNWDAFPKLKGLRSATPDAQ 
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FIGURE 130 
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CAGTTCTGAAATCAATGGAGTTAATTTAGGGAATACAAACCAGCCATGGGGGTGGAGATTGC 
CTTTGCCTCAGTGATTCTCACCTGCCTCTCCCTTCTGGCAGCAGGAGTCTCCCAGGTTGTTC 
TTCTCCAGCCAGTTCCAACTCAGGAGACAGGTCCCAAGGCCATGGGAGATCTCTCCTGTGGC 
TTTG CCGGCC ACT CATGAGAGTGTTTTTGTGTAAAGT ATT TTTT AG AAT A CTGTTGACTTCT 

1 TCATGATTTAATAACCATCCTTTGCGAAGTTTTATGAGGCTTTAGGGGAATGTCAACCCTCA 

1 

AATTTTTGTTATACTAGATGGCTTCCATTTACCCACCACTATTTTAAGGTCCCTTTATTTTT 
AGGTTCAAGGTTCATTTGACTTGAGAAAGTGCCCTTCTGCAGCTTCATTGATTTTGTTTATC 
TTCACTATTAATTGTAACGATTAAAAAAGAATAAGAGCACGCAGACCTCTAGGAGAATATTT 
TATCCCTGGGTGCCCCTGACACATTTATGTAGTGATCCCACAAATGTGATTGTTAATTTAAA 
TGTTATTCTAATATTAGTACATTCAGTTGTGATGTAATATGAATAACCAGAATCTATTTCTT 
AAAAGTTTTGAGTATATTTTTCAACTAGATATTTGTATAGAAAGACTGAATAGTGATG 

1 
3& 
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FIGURE 131 



MGVE I AFASVI LTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 



i 



BNSDOCID: <WO_ 9963088A2. ,.!...> 



WO 99/63088 „ / . PCT/US99/12252 

FIGURE 132 

GGGGAATCTGCAGTAGGTCTGCCGGCGATGGAGTGGTGGGCTAGCTCGCCGCTTCGGCTCTG 
GCTGCTGTTGTTCCTCCTGCCCTCAGCGCAGGGCCGCCAGAAGGAGTCAGGTTCAAAATGGA 
AAGTATTTATTGACCAAATTAACAGGTCTTTGGAGAATTACGAACCATGTTCAAGTCAAAAC 
TGCAGCTGCTACCATGGTGTCATAGAAGAGGATCTAACTCCTTTCCGAGGAGGCATCTCCAG 
GAAGATGATGGCAGAGGTAGTCAGACGGAAGCTAGGGACCCACTATCAGATCACTAAGAACA 
GACTGTACCGGGAAAATGACTGCATGTTCCCCTCAAGGTGTAGTGGTGTTGAGCACTTTATT 
TTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGATCAATGTACGAGATTATCCTCA 
GGTTCCTAAATGGATGGAGCCTGCCATCCCAGTCTTCTCCTTCAGTAAGACATCAGAGTACC 
ATGATATCATGTATCCTGCTTGGACATTTTGGGAAGGGGGACCTGCTGTTTGGCCAATTTAT 
CCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATCTGGTAAGGTCAGCAGCACAGTG 
GCCATGGAAAAAGAAAAACTCTACAGCATATTTCCGAGGATCAAGGACAAGTCCAGAACGAG 
ATCCTCTCATTCTTCTGTCTCGGAAAAACCCAAAACTTGTTGATGCAGAATACACCAAAAAC 
CAGGCCTGGAAATCTATGAAAGATACCTTAGGAAAGCCAGCTGCTAAGGATGTCCATCTTGT 
GGATCACTGCAAATACAAGTATCTGTTTAATTTTCGAGGCGTAGCTGCAAGTTTCCGGTTTA 
AACACCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTGGTGATGAGTGGCTAGAATTCTTC 
TATCCACAGCTGAAGCCATGGGTTCACTATATCCCAGTCAAAACAGATCTCTCCAATGTCCA 
AGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGAAAGGGGAA 
GCCAGTTTATTAGGAACCATTTGCAGATGGATGACATCACCTGTTACTGGGAGAACCTCTTG 
AGTGAATACTCTAAATTCCTGTCTTATAATGTAACGAGAAGGAAAGGTTATGATCAAATTAT 
TCCCAAAATGTTGAAAACTGAACTATAGTAGTCATCATAGGACCATAGTCCTCTTTGTGGCA 
ACAGATCTCAGATATCCTACGGTGAGAAGCTTACCATAAGCTTGGCTCCTATACCTTGAATA 
TCTGCTATCAAGCCAAATACCTGGTTTTCCTTATCATGCTGCACCCAGAGCAACTCTTGAGA 
AAGATTTAAAATGTCTCTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGA 
CCAGAAATCGTGAGATGTGGATTTTGAACCCAACTCTACCTTTCATTTTCTTAAGACCAATC 
ACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTGTGTCCA 
TGTGATGATGCCCTTTGTCCCATTATTTGGAGCAGAAAATTCGTCATTTGGAAGTAGTACAA 
CTCATTGCTGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACTTTATTTTAATGTAGG 
AAACCCTATGGGGTTTATGAAAAATACTTGGGGATCATTCTCTGAATGGTCTAAGGAAGCGG 
TAGCCATGCCATGCAATGATGTAGGAGTTCTCTTTTGTAAAACCATAAACTCTGTTACTCAG 
GAGG TTT C T ATAATG C C ACAT AGAAAGAGG C C AATT GC ATGAGTAATTATTGCAATTGGATT 
TCAGGTTCCCTTTTTGTGCCTTCATGCCCTACTTCTTAATGCCTCTCTAAAGCCAAA 
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FIGURE 133 

MEWWASSPLRLWLLLFLLPSAQGRQKESGSKWKVFIDQINRSLENYEPCSSQNCSCYHGVIE 
EDLTPFRGGISRKMMAEWRRKLGTHYQITKNRLYRENDCMFPSRCSGVEHFILEVIGRLPD 
MEMVINVRDYPQVPKWME PAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPI YPTGLGRWDL 
FREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDT 

■4 LGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVH 

I 

If YIPVKTDLSNVQELLQFVKANDDVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 
NVTRRKGYDQ 1 1 PKMLKTEL 
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FIGURE 134 



CACCCCTCCATTTCTCGCCATGGCCCCTGCACTGCTCCTGATCCCTGCTGCCCTCGCCTCTT 
TCATCCTGGCCTTTGGCACCGGAGTGGAGTTCGTGCGCTTTACCTCCCTTCGGCCACTTCTT 
GGAGGGATCCCGGAGTCTGGTGGTCCGGATGCCCGCCAGGGATGGCTGGCTGCCCTGCAGGA 
CCGCAGCATCCTTGCCCCCCTGGCATGGGATCTGGGGCTCCTGCTTCTATTTGTTGGGCAGC 
ACAGCCTCATGGCAGCTGAAAGAGTGAAGGCATGGACATCCCGGTACTTTGGGGTCCTTCAG 
AGGTCACTGTATGTGGCCTGCACTGCCCTGGCCTTGCAGCTGGTGATGCGGTACTGGGAGCC 
CATACCCAAAGGCCCTGTGTTGTGGGAGGCTCGGGCTGAGCCATGGGCCACCTGGGTGCCGC 
TCCTCTGCTTTGTGCTCCATGTCATCTCCTGGCTCCTCATCTTTAGCATCCTTCTCGTCTTT 
GACTATGCTGAGCTCATGGGCCTCAAACAGGTATACTACCATGTGCTGGGGCTGGGCGAGCC 
TCTGGCCCTGAAGTCTCCCCGGGCTCTCAGACTCTTCTCCCACCTGCGCCACCCAGTGTGTG 
TGGAGCTGCTGACAGTGCTGTGGGTGGTGCCTACCCTGGGCACGGACCGTCTCCTCCTTGCT 
TTCCTCCTTACCCTCTACCTGGGCCTGGCTCACGGGCTTGATCAGCAAGACCTCCGCTACCT 
m CCGGGCCCAGCTACAAAGAAAACTCCACCTGCTCTCTCGGCCCCAGGATGGGGAGGCAGAGT 
M GAGGAGCTCACTCTGGTTACAAGCCCTGTTCTTCCTCTCCCACTGAATTCT7VAATCCTTAAC 
ATCCAGGCCCTGGCTGCTTCATGCCAGAGGCCCAAATCCATGGACTGAAGGAGATGCCCCTT 
CTACTACTTGAGACTTTATTCTCTGGGTCCAGCTCCATACCCTAAATTCTGAGTTTCAGCCA 
CTGAACTCCAAGGTCCACTTCTCACCAGCAAGGAAGAGTGGGGTATGGAAGTCATCTGTCCC 
TTCACTGTTTAGAGCATGACACTCTCCCCCTCAACAGCCTCCTGAGAAGGAAAGGATCTGCC 
CTGACCACTCCCCTGGCACTGTTACTTGCCTCTGCGCCTCAGGGGTCCCCTTCTGCACCGCT 
GGCTTCCACTCCAAGAAGGTGGACCAGGGTCTGCAAGTTCAACGGTCATAGCTGTCCCTCCA 
GGCCCCAACCTTGCCTCACCACTCCCGGCCCTAGTCTCTGCACCTCCTTAGGCCCTGCCTCT 
GGGCTCAGACCCCAACCTAGTCAAGGGGATTCTCCTGCTCTTAACTCGATGACTTGGGGCTC 
CCTGCTCTCCCGAGGAAGATGCTCTGCAGGAAAATAAAAGTCAGCCTTTTTCTAAAAAAAA 
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FIGURE 135 

MAPALLL I PAALiAS F I LAFGTGVEFVRFTS LR PLLGG I PE SGGPDARQGWLAALQDRS I LAP 
LAWDLGLLLLFVGQHSLMAAERVKAWTSRYFGVLQRSLYVACTALALQLVMRYWEPIPKGPV 
LWEARAEPWATWVPLLCFVLHVISWLLIFSILLVFDYAELMGLKQVYYHVLGLGEPLALKSP 
RALRLFSHLRHPVCVELLTVLWWPTLGTDRLLLAFLLTLYLGLAHGLDQQDLRYLRAQLQR 
KLHLL.SRPQDGEAE 
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CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGA 
AGAAATTGCCAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGT 
TTTAATTAAGCATGGAATACAGAAAAC^ 71 ^AAAAAACTTAAGCTTTAATTTCATCTGGAATT 
CCACAGTTTTCTTAGCTCCCTGGACCCGGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTA 
TCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTAAAGAACCTTCGGCTCGCGTGCTTCTG 
AGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTAGGATGTCACTGAGATCC 
CTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTACCTCAGCCT 
TCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTT 
ACAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTT 
CTGGTCATTCTGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTAC 
TTGGGGTGAAAAAAAGTCTTGGTGGGGA x ATGAGGTTCTTACATTTTTCTTATTAGGCCAAG 
AGGCTGAAAAGGAAGACAAAATGTTGGCATTGTCCTTAGAGGATGAACACCTTCTTTATGGT 
GACATAATCCGACAAGATTTTTTAGACACATATAATAACCTGACCTTGAAAACCATTATGGC 
ATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATGAAGACAGACACTGATG 
TTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGAGAAGTTT 
TTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATAT 
TTCTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAA 
TGTCCAGAGATTTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTT 
GAAGATGTTTATGTCGGGATCTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGA 
CACAAATCTTTTCTTTCTATATAGAATCCATTTGGATGTCTGTCAACTGAGACGTGTGATTG 
CAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTTTGGCAGGTCATGCTAAGGAACACC 
ACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAGGATACCTTGTGGAAA 
GTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTTACACTG 
AACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTC 
AGGCCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAA 
GAAATTAATAGGACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGG 
TGTTACTGAGTTATAAGCTCACTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTG 
AACAATGTAGTCACTTGAAGGTTTTGTGTATATCTTATGTGGATTACCAATTTAAAAATATA 
TGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATACTGAACAAAATTTTACCTGTTTT 
TGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATTATTATTTAAAATTA 
CTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAGTGAAT 
CATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCAC 
TCCATTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAAT 
ATTTTACTGTGGTAATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 137 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQD 
FHFTLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEK 
EDKMLALSLEDEHLLYGD 1 1 RQDFLDTYNNLTLKT I MAFRWVTEFCPNAKYVMKTDTDVF IN 
TGNLVKYLLNLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRD 
LVPR I YEMMGHVKP I KFEDVYVG I CLNLLKVN IH I P EDTNLF FLYR I HLDVCQLRRVI AAHG 
FSSKEI ITFWQVMLRNTTCHY 
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FIGURE 138 



Si 



CCTCTGTCCACTGCTTTCGTGAAGACAAGATGAAGTTCACAATTGTCTTTGCTGGACTTCTT 
GGAGTCTTTCTAGCTCCTGCCCTAGCTAACTATAATATCAACGTCAATGATGACAACAACAA 
TGCTGGAAGTGGGCAGCAGTCAGTGAGTGTCAACAATGAACACAATGTGGCCAATGTTGACA 
ATAACAACGGATGGGACTCCTGGAATTCCATCTGGGATTATGGAAATGGCTTTGCTGCAACC 
AGACTCTTTCAAAAGAAGACATGCATTGTGCACAAAATGAACAAGGAAGTCATGCCCTCCAT 
TCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCAGGGT7VAGGGACCAGGAGGACCAC 
CTCCCAAGGGCCTGATGTACTCAGTCAACCCAAACAAAGTCGATGACCTGAGCAAGTTCGGA 
AAAAACATTGCAAACATGTGTCGTGGGATTCCAACATACATGGCTGAGGAGATGCAAGAGGC 
AAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTACTATGGATTGTGGACATTT 
CCTTCTGTGGAGACACGGTGGAGAACTAAACAATTTTTTAAAGCCACTATGGATTTAGTCAT 
CTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTTTTTACCATGTCATTCTGAAATT 
TTTCTCTACTAGTTATGTTTGATTTCTTTAAGTTTCAATAAAATCATTTAGCATTGAAAAAAA 
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FIGURE 139 



MKFTIVFAGLLGVFLAPALANYNIISTW 

IWDYGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVN 
PNKVDDLSKFGKNIANMCRGIPTYMAEEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 



1 



BNSDOCiU: <WO „.„9963088A2 J_> 



WO 99/63088 , PCT/US99/12252 

FIGURE 140 

CATTTCTGAAACTAATCGTGTCAGAATTGACTTTGAAAAGCATTGCTTTTTACAGAAGTATA 

TTAACTTTTTAGGAGTAATTTCTAGTTTGGATTGTAATATGAAATAATTTAAAAGGGCTTCG 

CTCATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTCTTATTGCTTACTGATTTTTT 

TGAGTTAAGAGTTGTTATATGCTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAA 

GAATT^lAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATG 

CAAGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGT 

TTGTTCGATTTCAACCAGAGACTATAGCATGTGCTTGCATCTACCTTGCAGCTAGAGCACTT 

CAGATTCCGTTGCCAACTCGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAAT 

CCAGGAAATCTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTAC 

TGGAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAGCCAAATTAAAAGCAAAGGGA 

TTGAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCCAGCCTCCAAGCC 

AT CATC AC CAAG AGAAGTAAAAGCTGAAGAGAAATC AC C AAT CT CC ATTAATGTGAAGAC AG 

TCAAAAAAGAACCTGAGGATAGACAACAGGCTTCCAAAAGCCCTTACAATGGTGTAAGAAAA 

GACAGCAAGAGAAGTAGAAATAGC AGAAGTG C AAGT CG AT CG AGGT CAAG AACACGAT CACG 

TTCTAGATCACATACTCCAAGAAGACACTATAATAATAGGCGGAGTCGATCTGGAACATACA 

GCTCGAGATCAAGAAGCAGGTCCCGCAGTCACAGTGAAAGCCCTCGAAGACATCATAATCAT 

GGTTCTCCTCACCTTAAGGCCAAGCATACCAGAGATGATTTAAAAAGTTCAAACAGACATGG 

TCATAAAAGGAAAAAATCTCGTTCTCGATCTCAGAGCAAGTCTCGGGATCACTCAGATGCAG 

CCAAGAAACACAGGCATGAAAGGGGACATCATAGGGACAGGCGTGAACGATCTCGCTCCTTT 

GAGAGGTCCCATAAAAGCAAGCACCATGGTGGCAGTCGCTCAGGACATGGCAGGCACAGGCG 

CTGACTTTCTCTTCCTTTGAGCCTGCATCAGTTCTTGGTTTTGCCTATCTACAGTGTGATGT 

AT GG AC TC AAT CAAAAACATTAAACGCAAACTGATTAGGATTTGATTTCTTGAAACCCTCTA 

GGTCTCTAGAACACTGAGGACAGTTTCT^TTGAAAAGAACTATGTTAATTTTTTTGCACATT 

AAAATGCCCTAGCAGTATCTAATTAAAAACCATGGTCAGGTTCAATTGTACTTTATTATAGT 

TGTGTATTGTTTATTGCTATAAGAACTGGAGCGTGAATTCTGTAAAAATGTATCTTATTTTT 

ATACAGATAAAATTGCAGACACTGTTCTATTTAAGTGGTTATTTGTTTAAATGATGGTGAAT 

ACTTTCTTAACACTGGTTTGTCTGCATGTGTAAAGATTTTTACAAGGAAATAAAATACAAAT 

CTTGTTTTTTCTAAAAAAAAAAAAAAAAAAGT 
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FIGURE 141 

MNDSLRTNVFVRFQPETIACACIYLAARALQIPLPTRPHWFLLFGTTEEEIQEICIETLRLY 
TRKKPNYELLEKEVEKRKVALQEAKLKAKGLNPDGTPALSTLGGFSPASKPSSPREVKAEEK 
SPISINVKTVKKEPEDRQQASKSPYNGVRKDSKRSRNSRSASRSRSRTRSRSRSHTPRRHYN 
NRRSRSGTYS SRSRSRSRSHSES PRRHHNHGS PHLKAKHTRDDLKS SNRHGHKRKKSRSRSQ 
SKSRDHSDAAKKHRHERGHHRDRRERSRSFERSHKSKHHGGSRSGHGRHRR 
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FIGURE 142 



TGGGGATAAAGGAAAAATGGTCAGGTATTAATGGCTTAAAGATTATTGGAAGGGGTTTATCA 
TTTTTTGAANNTATTCGGGTCANAATTGNCTTTGAAAAGCATTGCTTTTTACAGAAATATAT 
TANCTTTTTAGAGTAATTTCTAGTTTGGATTGTAATATGAAATTATTTAAAAGGGCTTCGCT 
CATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTNTTATTGCTTACTGATTTTTTTG 
; | AGTTAAGAGTTGTTATATGNTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAAGA 
I ATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATGCA 
AGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGTTT 
4 f GTTCGATTTCAACCAGAGANTATAGCATGTGCTTGCATCTACCTTGCAGNTAGAGCACTTCA 
GATTCCGTTGCCAACTNGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAATCC 
AGGAAATNTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTACTG 
GAAAAAGAAGTAGAAAAAAGAAAAGT AGCCTT ACAAGAAG C CNAATTAAAAG C AAAGGGATT 
GAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCC 

1 
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FIGURE 143 

GGCACGAGGCCTCGTGCCAAGCTTGGCACGAGGGTGCACCGCGTTCTCGCACGCGTCATGGC 
GGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCACCCTCATGCACAGGCTGGCGC 
CACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTTTGTTCCGATACAAGCACCCG 

^ TCTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCCCAGAGGCAGGAAAGAGCGGTG 

•i GGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCCGAGATGCCCCGTTCCAGCTGG 

I AGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGCTTCTTCCTGGAGTACCAGTGG 
TTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTTCACAGAGGCCTACTACTACAT 

y ; ; GCTGGG^CCAGCCAAGGAGACTAACATTGCTGTGTTCTGGTGCCTGCTCACGGTGACCTTCT 
CCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGCGCCGAGGAGGGGGGTGAGCGC 
TCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGCCATGCTGGTGCAAGTGGTGCG 
GGAGGAGACCCTCGAGCTGGGCCTGGAGCCTGGTCTGGCCAGCATGACCCAGAACTTAGAGC 
CACTTCTGAAGAAGCAGGGCTGGGACTGGGCGCTTCCTGTGGCCAAGCTGGCTATCCGCGTG 
GGACTGGCAGTGGTGGGCTCTGTGCTGGGTGCCTTCCTCACCTTCCCAGGCCTGCGGCTGGC 

J CCAGACCCACCGGGACGCACTGACCATGTCGGAGGACAGACCCATGCTGCAGTTCCTCCTGC 
ACACCAGCTTCCTGTCTCCCCTGTTCATCCTGTGGCTCTGGACAAAGCCCATTGCACGGGAC 
TTCCTGCACCAGCCGCCGTTTGGGGAGACGCGTTTCTCCCTGCTGTCCGATTCTGCCTTCGA 
CTCTGGGCGCCTCTGGTTGCTGGTGGTGCTGTGCCTGCTGCGGCTGGCGGTGACCCGGCCCC 
ACCTGCAGGCCTACCTGTGCCTGGCCAAGGCCCGGGTGGAGCAGCTGCGAAGGGAGGCTGGC 
CGCATCGAAGCCCGTGAAATCCAGCAGAGGGTGGTCCGAGTCTACTGCTATGTGACGGTGGT 
GAGCTTGCAGTACCTGACGCCGCTCATCCTCACCCTCAACTGCACACTTCTGCTCAAGACGC 
TGGGAGGCTATTCCTGGGGCCTGGGCCCAGCTCCTCTACTATCCCCCGACCCATCCTCAGCC 
AGCGCTGCCCCCATCGGCTCTGGGGAGGACGAAGTGCAGCAGACTGCAGCGCGGATTGCCGG 
GGGCCTGGGTGGCCTGCTTACTCCCCTCTTCCTCCGTGGGGTCCTGGCCTACCTCATCTGGT 

^ GGACGGCTGCCTGCCAGCTGCTCGCCAGCCTTTTCGGCCTCTACTTCCACCAGCACTTGGCA 
GGCTCCTAGCTGCCTGCAGACCCTCCTGGGGCCCTGAGGTCTGTTCCTGGGGCAGCGGGACA 

: % CTAGCCTGCCCCCTCTGTTTGCGCCCCCGTGTCCCCAGCTGCAAGGTGGGGCCGGACTCCCC 
GGCGTTCCCTTCACCACAGTGCCTGACCCGCGGCCCCCCTTGGACGCCGAGTTTCTGCCTCA 
GAACTGTCTCTCCTGGGCCCAGCAGCATGAGGGTCCCGAGGCCATTGTCTCCGAAGCGTATG 
TGCCAGGTTTGAGTGGCGAGGGTGATGCTGGCTGCTCTTCTGAACAAATAAAGGAGCATGCC 
GATTTTTAA 
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FIGURE 144 



MAVLGVQLWTLLTATLMHRLAPHCSFARWLLClSrGSLFRYKHPSEEELRALAGKPRPRGRKE 
RWANGLSEEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVDFAVYSGGVYLFTEAYY 
YMLGPAKETNI AVFWCLLTVTFS I KMFLTVTRLYFS AEEGGERS VCLTFAFLFLLLAMLVQV 
VREETLELGLEPGLASMTQNLEPLLKKQGWDWALPVAKLAIRVGLAWGSVLGAFLTFPGLR 
LAQTHRDALTMSEDRPMLQFLLHTSFLSPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSA 
FDSGRLWLLWLCLLRLAVTRPHLQAYLCLAKARVEQLRREAGRIEAREIQQRAA^RVYCYVT 
WSLQYLTPLILTLNCTLLLKTLGGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARI 
AGALGGLLTPLFLRGVLAYLIWWTAACQLLASLFGLYFHQHLAGS 
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FIGURE 145 



CGTTNGCACGCGTCAATGGCGGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCAC 
CCTCATGCACAGGCTGGCGCCACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTT 
TGTTCCGATACAAGCACCCGTNTTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCC 
CAGAGGCAGGAAAGAGCGGTGGGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCC 
GAGATGCCCCGTTCCAGCTGGAGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGC 
TTCTTCCTGGAGTACCAGTGGTTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTT 
CACAGAGGCCTACTACTACATGCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGT 
GCCTGCTCACAGTGACCTTCTCCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGC 
GCCGAGGAGGGGGGTGAGCGCTCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGC 
CATGCTGGTGCAAGCG 
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FIGURE 146 

GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTA 
TTAACGTGGCTTAATCTGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGC 
ATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGGTTTGGGCCCTTGTAGCTGACAGAAGGTGGC 
CAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGCTGGTCTTGCCTTGG 

^ CTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTG 
TAAAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTC 
CAGACGGCTGTGCGAGCCTCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACC 

| ATCTCCTTAATGACAGACGAGCCTGGCCTAGACAACCCTGCCTACGTGTCCTCGGCAGAGGA 
CGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTAGGGCACGGCCCT 
TTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTT 

. M CGAAGGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGGAGGGAAAATTCTGA 
v : AAACACCACTGCCCCTGAAGTCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTA 
CCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAAGCCTCTCTATTAGGCTGGTGGGAGGT 
AGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGGTGATCGCCAG 
AGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATG 
TCCCTCACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTG 
ATGCGTGAACAGAAGTTCCGCAGCAGGAACAATGGACAGGCCCCGGATGCCTACAGACCCCG 
AGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCCCCGAGGAGCAGCTTGGAATAAAAC 

^ TGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCGGTGTGGCA 

1 TATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCG 

^ ATATGGCAGCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCG 
TCGTGTCCCGCCAGGTTCGGCAGCGGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGC 
AATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACACTCCCAAGCCCCTCCATCCTAC 
AATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCGGCATGA 
CCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAG 
CCCGGAGGAGTCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGA 
TGGGGTCGAACTGACAGAGGTCAGCCGGAGTGAGGCAGTGGCATTATTGAAAAGAACATCAT 
CCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGCCCCAGGAAGACTGCAGCAGC 
CCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCTGGGT 
CATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACA 
CAGCTGGAAGTCTGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCT 
TTTTTCATCAAATCCATTGTTGAAGGAACACCAGCATACAATGATGGAAGAATTAGATGTGG 
TGATATTCTTCTTGCTGTCAATGGTAGAAGTACATCAGGAATGATACATGCTTGCTTGGCAA 
GACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTTTT 

^ TTATAGAATCAATGATGGGTCAGAGGAAAACAGAAAAATCACAAATAGGCTAAGAAGTTGAA 
ACACTATATTTATCTTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGG 
AAAAGTATGATCAT CTAATGAAAG C C AGTT AC AC CT C AGAAAAT ATGATT CCAAAAAAATT A 

./| AAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTACAACATTGTTTATATTT 
TTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCA 
TTTTTAATTTACAGCTAAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAA 
ACAAGAAT AAAT ATTT TT CAGAAG T T AAA 
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FIGURE 147 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTAT 
APS PEVSAAAT I SLMTDEPGLDNPAYVS SAEDGQ PA I S P VDS GRSNRTRARP FERST I RSRS 
FKKINRALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHL I PDGEI TS IKINRVDP 
SESLSIRLVGGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLL 
RQPCQVLWLTVMREQKFRSRNNGQAPDAYRPRDDSFH^^ILNKSSPEEQLGIKLVRKVDEPGV 
FIFNVLDGGVAYRHGQLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRS 
PDIFQEAGWNSNGSWSPGPGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHRE 
WDLPIYVISVEPGGVISRDGRIKTGDI_,NYDGVELTEVSRSEAVALLKRTSSSIVLKALEV 
KEYEPQEDCSSPAALDSNHNMAPPSDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIV 
GGYEEYNGNKPFFIKSIVEGTPAYNDGRIkCGDILLAVNGRSTSGMIHACLARLLKELKGRI 
TLTIVSVJPGTFL 
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FIGURE 148 

C C AAAGTGATCATTTGAAAAAGAGATAT CC ACAT CTTC AAGC CC AT AT AAAGGATAGAAGCT 
GCACAGGGCAGCTTTACTTACTCCAGCACCTTCCTCTCCCAGGCAAATGGTGCTGACCATCT 
TTGGGATACAATCTCATGGATACGAGGTTTTTAACATCATCAGCCCAAGCAACAATGGTGGC 
AATGTTCAGGAGACAGTGACAATTGATAATGAAAAAAATACCGCCATCGTTAACATCCATGC 
AGGATCATGCTCTTCTACCACAATTTTTGACTATAAACATGGCTACATTGCATCCAGGGTGC 
TCTCCCGAAGAGCCTGCTTTATCCTGAAGATGGACCATCAGAACATCCCTCCTCTGAACAAT 
CTCCAATGGTACATCTATGAGAAACAGGCTCTGGACAACATGTTCTCCAACAAATACACCTG 
GGTCAAGTACAACCCTCTGGAGTCTCTGATCAAAGACGTGGATTGGTTCCTGCTTGGGTCAC 
CCATTGAGAAACTCTGCAAACATATCCCTTTGTATAAGGGGGAAGTGGTTGAAAACACACAT 
AATGTCGGTGCTGGAGGCTGTGCAAAGGCTGGGCTCCTGGGCATCTTGGGAATTTCAATCTG 
TG CAGA C ATT C ATG TT TAG G ATG ATT AG CCCTCTTGTTTT AT CTTTT C AAAGAAAT AC AT CC 
TTGGTTTACACTCAAAAGTCAAATTAAATTCTTTCCCAATGCCCCAACTAATTTTGAGATTC 
AGTCAGAAAATATAAATGCTGTATTTATA 
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FIGURK 149 

MKILVAFLWLTIFGIQSHGYEVFNIISPSNNGGIsT^QETVTIDNEKNTAIVNIHAGSCSSTT 
IFDYKHGYIASRVLSRI^CFILKMDHQNIPPLNNLQWYiyEKQALDNMFSNKYTWVKYNPLE 
Shi KDVDWFLLGSP I EKLCKH I PLYKGE WENTHNVGAGGCAKAGLLG I LG I S I CAD IHV 
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FIGURE 150 



GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAGGC 
ATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGGCACG 
GGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCCCCAAAG 
GCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCTATTCCCTC 
TGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGCCGGCCTCCTT 
CAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCTGCCGGGCGTCCT 
CCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGGAGCTGTGGTCCAAG 
CCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGGGCAGGCCCCAGGGTGGA 
GATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAACAGCCTGATCGGGAAGGATG 
GGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCTGCCAACTTCTCCTTCCTGCCG 
AGCCAGACATCGGAGTGGTTCTGGTGCCAGGCTGCAAACAACGCCAATGTCCAGCACAGCGC 
CCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTGGAGA 
GCCCCATCCTTGCCTTGCCGCTCTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGG 
GGGTTCAGGATAGGGAATGGGGAGGTCAGAGGACGCAAAGCAGCAGCCATGTAGAATGAACC 
GTCCAGAGAGCCAAGCACGGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGA 
GTTCATGCAAAATGAGTGTGTTTTAGCTGCTCTTGCCACAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 151 

MGLPGLFCLAVLAASSFSKAREEE ITPVVS I AYKVLEVFPKGRWVL I TCCAPQPPPP I TYSL 
CGTKNIKVAKKWKTHEPASFNLWTLKSSPDLLTYFCRASSTSGAHVDSARLQMHWELWSK 
PVSELRANFTLQDRGAGPRVEM I CQASSGS PPITNSLI GKDGQVHLQQRPCHRQPANFSFLP 
SQTSDWFWCQAANNANVQHSALTVVPPGGDQKMEDWQGPLESPILALPLYRSTRRLSEEEFG 
GFR I GNGEVRGRKAAAM 
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FIGURE 152 

GGTCCTTA ATG GCAGCAGCGGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTG 
CTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGT 
CATCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGA 
CTTTTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGGGGAAGAAA 
CTAAATGTCACAACGGCCTGGAAAGCACAGAACC CAGT ^VCTG AGAGAGGTGGTGGACATACT 
TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACAGACCCAAGGAACCCCTCACCCTGC 
AGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGT 
TTCGATGGGCAGATCTTCCTCCTCTTTb^CTCAGAGAAGAGAATGTGGACAACGGTTCATCC 
TGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTCCTTCCATT 
ACTTCTGAATGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAGCACC 
CTGGAGCCAAGTGCAGGAGCACCACTCGCGATGTCCTCAGGCACAACCCAACTCAGGGCCAC 
AGCCACCACCCTCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCATCCTCCCTGGCA 
TCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCTGATACGAAAAGGCTCCTGTGAGCACG 
GTCTTGATCAAACTCGCGCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGT 
GGCCTCCAGCAGATCATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTT 
TTGCCAACAATTTTACCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACC 
TGATGGAATTCCTGCACTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTC 
TCTTTTTGTTTGGAAAATCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATT 
GTCAGTAAAATAATCACGTTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAG 
AATTTTTAAATTATTTAATAAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTAT 
TGTTCTGTACTGATATTTAAATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 153 



MAAAAATKILLCLPLLLLLSGWSRAGRADPHSLCYD ITVI PKFRPGPRWCAVQGQVDEKTFL 
HYDCGNKTVTPVSPLGKKLNVTTAWKAQNPVLREWDILTEQLRDIQLENYTPKEPLTLQAR 
MSCEQKAEGHSSGSWQFSFDGQIFLLFDSEKRMWTTVHPGARKMKEKWENDKWAMSFHYFS 
MGDCIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLI ILPCFILPGI 



0 
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FIGURE 154 

GGGAAAGCCATTTCGAAAACCCATCTATACAAACTATATATTTTCATTTCTGCTGCTAGCTG 
CCTTGGGCCTCACAATTTTCATTCTGTTTTCTGACTTTCAAGTTATATACCGTGGAATGGAG 
TTGATCCCAACCATAACATCGTGGAGGGTTTTAATTTTGGTGGTAGCCCTCACCCAATTCTG 
GTGTGGCTTTCTTTGCAGAGGATTCCACCTTCAAAATCATGAACTCTGGCTGTTGATCAAAA 
!$ GAGAATTTGGATTCTACTCTAAAAGTCAATATAGGACTTGGCAAAAGAAGCTAGCAGAAGAC 
i TCAACCTGGCCTCCCATAAACAGGACAGATTATTCAGGTGATGGCAAAAATGGATTCTACAT 
CAACGGAGGCTATGAAAGCCATGAACAGATTCCAAAAAGAAAACTCAAATTGGGAGGCCAAC 
S CCACAGAACAGCATTTCTGGGCCAGGCTGTAATCAGAATTGTCGTCGTACATGCTCAACAGC 
; ATTGCTTTTTTCCCCAAAATTAACACATTGTGGAGAAGTGATGATACTCTCCCCTTACCTTT 
i CCTCTCTCCATTCAAGCATTCAAAGTATATTTTCAATGAATTAAACCTTGCAGCAAGGGACC 
TTAGATAGGCTTATTCTGACTGTATGCTTTACCAATGAGAGAAAAAAATGCATTTCCTGTAT 
CATC CTTTTCAATAAACTGT ATT CATTTTGAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 155 

MEL I PT I TSWRVL I L WALTQFWCGFLCRG FHLQNHE LWLL I KREFGFYS KS QYRTWQKKLA 
EDSTWPPINRTDYSGDGKNGFYINGGYESHEQIPKRKLKLGGQPTEQHFWARL 
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FIGURE 156 

GTTCTCCTTTCCGAGCCAAAATCCCAGGCGATGGTGAATTATGAACGTGCCACACCATGAAG 
CTCTTGTGGCAGGTAACTGTGCACCACCACACCTGGAATGCCATCCTGCTCCCGTTCGTCTA 
CCTCACGGCGCAAGTGTGGATTCTGTGTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCC 
AGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCAGCAAGGTGGTGTGCACGCGCCGG 
fj GGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACCTCATGGAGAA 
CAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGT 
g TGGGCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCTCAAC 
| ACCCTGGAGCTGTTCGACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTC 
' n CAAGCTGCGGGAGCTCTGGCTTCGCAACAACCCCATCGAAAGCATCCCCTCTTACGCCTTCA 
ACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCAAGAAGCTGGAGTATATCTCT 
GAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTGCAACATTAA 
AGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACT 
TCCCTGAGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATG 
AACTCACAGGTCAGCCTGATTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACT 
CAACTTGGCCCACAATAACCTCTCTTCTTTGCCCCATGACCTCTTTACCCCGCTGAGGTACC 
TGGTGGAGTTGCATCTACACCACAACCCTTGGAACTGTGATTGTGACATTCTGTGGCTAGCC 
TGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTCCCAT 
GCACATGCGAGGCCGCTACCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCT 
TCATCATGGACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGT 
'M CGGACTCCCCCTATGTCCTCCGTGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGC 
CTCCCGCCACCCAAGGATCTCTGTCCTCAACGACGGCACCTTGAACTTTTCCCACGTGCTGC 
TTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCAACTCCAACGCCTCG 
GCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAGT 
AACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCTA 
CCACGTCCACTGGTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACC 
CGTGTGCCCAAGCAGGTGGCAGTACCCGCGACAGACAGCACTGACAAGATGCAGACCAGCCT 
GGATGAAGTCATGAAGACCACCAAGATCATCATTGGCTGCTTTGTGGCAGTGACTCTGCTAG 
CTGCCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGCGGAGTACAGTC 
ACAGCCGCCCGGACTGTTGAGATAATCCAGGTGGACGAAGACATCCCAGCAGCAACATCCGC 
AGCAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTC 
ATGACCATATTAACTACAACACCTACAAACCAGCACATGGGGCCCACTGGACAGAAAACAGC 
CTGGGGAACTCTCTGCACCCCACAGTCACCACTATCTCTGAACCTTATATAATTCAGACCCA 
TACCAAGGACAAGGTACAGGAAACTCAAATAT^ACTCCCCTCCCCCAAAAAACTTATAAAAT 
GCAATAGAATGCACACAAAGACAGCAACTTTTGTACAGAGTGGGGAGAGACTTTTTCTTGTA 
^ TATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGA 
C AAAAAGT CAAAACA 
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FIGURE 157 

MKLLWQVTVHHHTWNAILLPFVYLTAQVWILCAAIAAAASAGPQNCPSVCSCSNQFSKWCT 
RRGLS E VPQG I P SNTR YLNLMENN IQM I QADT FRHLHHLE VLQLGRNS I RQ I E VGAFNGL AS 
LNTLELFDNWLTVIPSGAFEYLSKLRELWLRNNPIESIPSYAFNRVPSLMRIiDLGELKKLEY 
ISEGAFEGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLW 
VMNSQVSLIERNAFDGI^SLVELNLAHNNLSSLPHDLFTPLRYLVELHLHHNPVraCDCDILW 
LtAWWLREY I PTNSTCCGRCHAPMHMRGRYLVE VDQAS FQCSAPF IMDAPRDLNI SEGRMAEL 
KCRTPPMSSVKWLLPNGTVLSHASRHPRISVLNDGTLNFSHVLLSDTGVYTCMVTNVAGNSN 
ASAYLNVSTAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPAYTTSTTVLIQ 
TTRVPKQVAVPATDTTDKMQTSLDEVMKTTKI I IGCFVAVTLLAAAMLI VFYKLRKRHQQRS 
TVTAARTVE 1 1 Q VDED I PAATSAAATAAPSGVS GEGAWL PT I HDH INYNT YKP AHGAHWTE 
NSLGNSLHPTVTTISEPYIIQTHTRDKVQETQI 
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FIGURE 158 

CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTT 
TTCTTCAGCTCCTTCTCATCTCGT'CCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGC 
CCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTGAATATGATCAGATTGAGTGCGT 
CTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAATG 
1? AGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGA 
AATGGCTCATGGGGGGGTACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTG 
f§ CCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTGGCCAGGTTCTGCGAGCCCCAAAGG 
H GTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGCTAAA 
CCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCA 
GTATGACTATGTTGAGGTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCT 
^ GTGGCAACGAGCGGCCAGCTCCTATCCAG^GCATAGGATCCTCACTCCACGTCCTCTTCCAC 
TC CG AT GG CT CCAAGAATTTTG AC GGTTTCCATGCC AT TTATGAGGAGATCACAGC AT GCTC 
CTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTG 
CCTGCTTGGCAGGCTATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCA 
GACCCTGGGGGCCCAGTCAATGGGTACCAGAAAATAACAGGGGGCCCTGGGCTTATCAACGG 
ACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGTTCTTAGTG 
GCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATA 
AAAGCCTGCCGAGAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGT 
^ TCAGTCAAGGGAGACACCATTACACCAGCTATACTCAGCGGCCTTCAGCAAGCAGAAACTGC 
|j AGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGGATACCAACAT 
m CTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAG 
GAGGACATGTCTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCG 
GGAAAATTGAGAAC AT CACT GCTC CAAAG AC C C AAG GG TTG C G CTGG C CGTGGCAGGC AG CC 
ATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGCGTGGTTCCTAGT 
CTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACC 
TGGGGAAGGTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGG 
GATGATGACCGGGATGAGAAGACCATCCAGAGCCTACAGATTTCTGCTATCATTCTGCATCC 
CAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCTCCTAGACAAGGCCC 
GT AT C AG C AC C CGAGT C C AG CC C AT CTG CCT CG C TGCCAGTCGGGATCTC AG CACT TCCTTC 
CAGGAGTCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTT 
CAAGAACGACACACTGCGCTCTGGGGTGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGC 
AGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTTCTGTGCCAGCTGGGAA 
CCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCC 
GGGACGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATA 
& AAACATGCAGC^ACAGGCTCTCCACTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATT 
GAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAGTGTTTCTGTATATCCGTC 
TGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGG 
CTGTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGC 
TGGTAGGCTGATGCCGCGTCCACTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAG 
AAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCACTCCACTGACCTGGTGGTCT 
TCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGC 
TGGGATGTGGTGCATGCCTTTGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCC 
ATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCTGAACTACAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 159 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREW 
GYTIPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGD 
CMRCGQVLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRD 
GDNRDGQ 1 1 KRVCGNERPAP IQS I GS SLHVLFHSDGSKNFDGFHAI YEE ITACSSSPCFHDG 
TCVLDKAGSYKCACLjAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTV 
I VSFFCNNS YVLSGNEKRTCQQNGEWSGKQP I C I KACRE PK I SDLVRRRVLPMQVQSRETPLH 

QLYSAAFSKQKLQSAPTKKPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGK 
WSGR^^SCIPICGKIENITAPKTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNE 
RTWVAAHCVTDLGKVTM I KTADLKWLGKFYRDDDRDE KT I QS LQ ISAI I LHPNYDP ILLD 
ADIAILKLLDKARISTRVQPICLAASRDLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSG 
WSWDSLLCEEQHEDHGIPVSVTDNMFCASWEPTAPSDICTAETGGIAAVSFPGRASPEPR 
WHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWIERJNTMK 

4& 
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FIGURE 160 

accaggcattgtatcttcagttgtcatcaagttcgcaatcagattggaaaagctcaacttga 
agctttcttgcctgcagtgaagcagagagatagatattattcacgtaataaaaaacatgggc 
ttcaacctgactttccacctttcctacaaattccgattactgttgctgttgactttgtgcct 

# gacagtggttgggtgggccaccagtaactacttcgtgggtgccattcaagagattcctaaag 
caaaggagttcatggctaatttccataagaccctcattttggggaagggaaaaactctgact 

3$ aatg aagcatccacgaagaaggtagaacttgacaactgt cctt ctgtgtct c cttac ct cag 

aggccagagcaagctcattttcaaaccagatctcactttggaagaggtacaggcagaaaatc 
ccaaagtgtccagaggccggtatcgccctcaggaatgtaaagctttacagagggtcgccatcctc 

t gttccccaccggaacagagagaaacacgtgatgtacctgctggaacatctgcatcccttcct 
gc ag aggc ag c ag ctggatt at gg c atct acg t c at cc ac caggctgaaggtaaaaagttt a 
atcgagccaaactcttgaatgtgggctatctagaagccctcaaggaagaaaattgggactgc 
tttatattccacgatgtggacctggtacccgagaatgactttaacctttacaagtgtgagga 
gcatcccaagcatctggtggttggcaggaacagcactgggtacaggttacgttacagtggat 
attttgggggtgttactgccctaagcagagXgcagtttttcaaggtgaatggattctctaac 

1 aactactggggatggggaggcgaagacgatgacctcagactcagggttgagctccaaagaat 
gaaaatttcccggcccctgcctgaagtgggtaaatatacaatggtcttccacactagagaca 
aaggcaatgaggtgaacgcagaacggatgaagctcttacaccaagtgtcacgagtctggaga 
acagatgggttgagtagttgttcttataaattagtatctgtggaacacaatcctttatatat 
caacatcacagtggatttctggtttggtgcatgaccctggatcttttggtgatgtttggaag 
aactgattctttgtttgcaataattttggcctagagacttcaaatagtagcacacattaaga 
acctgttacagctcattgttgagctgaatttttcctttttgtattttcttagcagagctcct 
ggtgatgtagagtataaaacagttgtaacaagacagctttcttagtcattttgatcatgagg 
gttaaatattgtaatatggatacttgaaggactttatataaaaggatgactcaaaggataaa 
atgaacgctatttgaggactctggttgaaggagatttatttaaatttgaagtaatatattat 
gggataaaaggccacaggaaataagactgctgaatgtctgagagaaccagagttgttctcgt 

^ ccaaggtagaaaggtacgaagatacaatactgttattcatttatcctgtacaatcatctgtg 
aagtggtggtgtcaggtgagaaggcgtccacaaaagaggggagaaaaggcgacgaatcagga 
cacagtgaacttgggaatgaagaggtagcaggagggtggagtgtcggctgcaaaggcagcag 

^ TAGCTGAGCTGGTTGCAGGTGCTGATAGCCTTCAGGGGAGGACCTGCCCAGGTATGCCTTCC 

agtgatgcccaccagagaatacattctctattagtttttaaagagtttttgtaaaatgattt 
tgtacaagtaggatatgaattagcagtttacaagtttacatattaactaataataaatatgt 
ctatcaaatacctctgtagtaaaatgtgaaaaagcaaaa 
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FIGURE 161 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYPVGAIQEIPKAKEFMANFHKTLILGKGKT 
LTNEASTKKVELDNCPSVSP YLRGQS KL I FKPDLTLEE VQAENPKVSRGRYRPQECKALQRV 
AILVPHRNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEAbKEEN 
WDCFIFHDVDLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNG 
FSNNYWGWGGEDDDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSR 
VWRTDGLS S CS YKL VS VEHNPL Y INI T VDFWFGA 

Important features : 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites : 

amino acids 4-7, 220-223 and 335-338 

Xylose isomerase proteins : 

amino acids 191-201 



.9963088A2. I 



WO 99/63088 



FIGURE 162 



PCT/US99/12252 



CGTGGGCCGGGGTCGCGCAGCGGGCTGTGGGCGCGCCCGGAGGAGCGACCGCCGCAGTTCTC 
GAGCTCCAGCTGCATTCCCTCCGCGTCCGCCCCACGCTTCTCCCGCTCCGGGCCCCGCAATG 
GCCCAGGCAGTGTGGTCGCGCCTCGGCCGCATCCTCTGGCTTGCCTGCCTCCTGCCCTGGGC 
CCCGGCAGGGGTGGCCGCAGGCCTGTATGAACTCAATCTCACCACCGATAGCCCTGCCACCA 
^ CGGGAGCGGTGGTGACCATCTCGGCCAGCCTGGTGGCCAAGGACAACGGCAGCCTGGCCCTG 
™ CCCGCTGACGCCCACCTCTACCGCTTCCACTGGATCCACACCCCGCTGGTGCTTACTGGCAA 
GATGGAGAAGGGTCTCAGCTCCACCATCCGTGTGGTCGGCCACGTGCCCGGGGAATTCCCGG 
;| TCTCTGTCTGGGTCACTGCCGCTGACTGCTGGATGTGCCAGCCTGTGGCCAGGGGCTTTGTG 
'§ GTCCTCCCCATCACAGAGTTCCTCGTGGGGGACCTTGTTGTCACCCAGAACACTTCCCTACC 
^ CTGGCCCAGCTCCTATCTCACTAAGACCGTCCTGAAAGTCTCCTTCCTCCTCCACGACCCGA 
GCAACTTCCTCAAGACCGCCTTGTTTCTCTACAGCTGGGACTTCGGGGACGGGACCCAGATG 
GTGACTGAAGACTCCGTGGTCTATTATAACTATTCCATCATCGGGACCTTCACCGTGAAGCT 
CAAAGTGGTGGCGGAGTGGGAAGAGGTGGAGCCGGATGCCACGAGGGCTGTGAAGCAGAAGA 
CCGGGGACTTCTCCGCCTCGCTGAAGCTGCAGGAAACCCTTCGAGGCATCCAAGTGTTGGGG 
CCCACCCTAATTCAGACCTTCCAAAAGATGACCGTGACCTTGAACTTCCTGGGGAGCCCTCC 
TCTGACTGTGTGCTGGCGTCTCAAGCCTGAGTGCCT CC CG CTGGAGGAAGGGGAGTGC CACC 
CTGTGTCCGTGGCCAGCACAGCGTACAACCTGACCCACACCTTCAGGGACCCTGGGGACTAC 
TGCTTCAGCATCCGGGCCGAGAATATCATCAGCAAGACACATCAGTACCACAAGATCCAGGT 
GTGGCCCTCCAGAATCCAGCCGGCTGTCTTTGCTTTCCCATGTGCTACACTTATCACTGTGA 
TGTTGGCCTTCATCATGTACATGACCCTGCGGAATGCCACTCAGCAAAAGGACATGGTGGAG 
AACCCGGAGCCACCCTCTGGGGTCAGGTGCTGCTGCCAGATGTGCTGTGGGCCTTTCTTGCT 
fj GGAGACTCCATCTGAGTACCTGGAAATTGTTCGTGAGAACCACGGGCTGCTCCCGCCCCTCT 
'4i ATAAGTCTGTCAAAACTTACACCGTGTGAGCACTCCCCCTCCCCACCCCATCTCAGTGTTAA 
CTGACTGCTGACTTGGAGTTTCCAGCAGGGTGGTGTGCACCACTGACCAGGAGGGGTTCATT 
TGCGTGGGGCTGTTGGCCTGGATCATCCATCCATCTGTACAGTTCAGCCACTGCCACAAGCC 
CCTCCCTCTCTGTCACCCCTGACCCCAGCCATTCACCCATCTGTACAGTCCAGCCACTGACA 
TAAGCCCCACTCGGTTACCACCCCCTTGACCCCCTACCTTTGAAGAGGCTTCGTGCAGGACT 
TTGATGCTTGGGGTGTTCCGTGTTGACTCCTAGGTGGGCCTGGCTGCCCACTGCCCATTCCT 
CTCATATTGGCACATCTGCTGTCCATTGGGGGTTCTCAGTTTCCTCCCGCAGACAGCCCTAC 
CTGTGCCAGAGAGCTAGAAAGAAGGTCATAAAGGGTTAAAAATCCATAACTAAAGGTTGTAC 
ACATAGATGGGGACACTCACAGAGAGAAGTGTGCATGTACACACACCACACACACACACACA 
CACACACACACAGAAATATAAACACATGCGTCACATGGGCATTTCAGATGATCAGCTCTGTA 
TCTGGTTAAGTCGGTTGCTGGGATGCACCCTGCACTAGAGCTGAAAGGAAATTTGACCTCCA 
AGCAGCCCTGACAGGTTCTGGGCCCGGGCCCTCCCTTTGTGCTTTGTCTCTGCAGTTCTTGC 
GCCCTTTATAAGGCCATCCTAGTCCCTGCTGGCTGGCAGGGGCCTGGATGGGGGGCAGGACT 
AATACTGAGTGATTGCAGAGTGCTTTATAAATATCACCTTATTTTATCGAAACCCATCTGTG 
AAACTTTCACTGAGGAAAAGGCCTTGCAGCGGTAGAAGAGGTTGAGTCAAGGCCGGGCGCGG 
TGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGATCAGGA 
m GATCGAGACCACCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAAATACAAAAAGTT 
AGCCGGGCGTGGTGGTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATG 
GTGCGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCAGATGGCGCCACTGCACTCCAGCCTGA 
M GTGACAGAGCGAGACTCTGTCTCCA 
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FIGURE 163 

MAQAVWSRLGR I LWLACLLPWAPAGVAAGL YELNLTTD S P ATTG AWT IS AS LV AKDNGS LA 
LPAJ)AHLYRFHWIHTPLVLTGKMEKGLSSTIRWGHVPGEFPVSVWTAADCWMCQPVARGF 
WLPITEFLVGDLWTQNTSLPWPSSVLTKTVLKVSFLLHDPSNFLKTALFLYSWDFGDGTQ 
MVTEDSWYYNYSIIGTFTVKLKWAEWEEVEPDATRAVKQKTGDFSASLKLQETLRGIQVL 
GPTLIQTFQKMTVTLNFLGSPPLTVCWRLKPECLPLEEGECHPVSVASTAYNLTHTFRDPGD 
YCFS I RAEN IIS KTHQYHKI QVWP SR I Q PAVFAF PC AT L I TVMLAF IMYMTLRNATQQKDMV 
ENPEPPSGVRCCCQMCCGPFLLETPSEYLEIVRENHGLLPPLYKSVKTYTV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domain: 

amino acids 339-362 

N-glycosylation sites. 

amino acids 34-37, 58-61, 142-145, 197-200, 300-303 and 364-367 
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FIGURE 164 

gctcaagacccagcagtgggacagccagacagacggcacg atg gcactgagctcccagatct 
gggccgcttgcctcctgctcctcctcctcctcgccagcctgaccagtggctctgttttccca 
caacagacgggacaacttgcagagctgcaaccccaggacagagctggagccagggccagctg 
gatgcccatgttccagaggcgaaggaggcgagacacccacttccccatctgcattttctgct 
& gcggctgctgtcatcgatcaaagtgtgggatgtgctgcaagacgtagaacctacctgccctg 
4 cccccgtcccctcccttccttatttattcctgctgccccagaacataggtcttggaataaaa 
tggctggttcttttgttttccaaaaaaaaaaa^^ 
aaaa: ^aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 



I 
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FIGURE 165 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTH 
FPICIFCCGCCHRSKCGMCCKT 
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FIGURE 166 

CTGTCAGGAAGGACCATCTGAAGGCTGCAATTTGTTCTTAGGGAGGCAGGTGCTGGCCTGGC 
CTGGATCTTCCACCATGTTCCTGTTGCTGCCTTTTGATAGCCTGATTGTCAACCTTCTGGGC 
ATCTCCCTGACTGTCCTCTTCACCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGG 
AGTCTCCTTTGGTATCCGCAAACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTA 
CCTTGAGAATGGAGCGAGGAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAAC 
GGAATCATTGCAAAGGATCCCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGG 
TAGTAGTAAGGCTCTGGACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCC 
GGAAAGGAATGGAGACCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTG 
GAGTCCTGGAACCTGCTGAGCAGAACCAATTATAACTTCCAGTACATCAGCCTTCGGCTCAC 
GGTCCTGTGGGGGTTAGGAGTGCTGATTCGGTACTGCTTTCTGCTGCCGCTCAGGATAGCAC 
TGGCTTTCACAGGGATTAGCCTTCTGGTGGTGGGCACAACTGTGGTGGGATACTTGCCAAAT 
GGGAGGTTTAAGGAATTCATGAGTAAACATGTTCACTTAATGTGTTACCGGATCTGCGTGCG 
AG CG CTGA CAGC CATCAT CACCTACC ATGACAGGGAAAACAGACCAAGAAATGGTGG C AT CT 
GTGTGGCCAATCATACCTCACCGATCGATGTGATCATCTTGGCCAGCGATGGCTATTATGCC 
ATGGTGGGTCAAGTGCACGGGGGACTCATGGGTGTGATTCAGAGAGCCATGGTGAAGGCCTG 
CCCACACGTCTGGTTTGAGCGCrCGGAAGTGAAGGATCGCCACCTGGTGGCTAAGAGACTGA 
CTGAACATGTGCAAGATAAAAGCAAGCTGCCTATCCTCATCTTCCCAGAAGGAACCTGCATC 
AATAAT AC ATCGGTG ATG ATGTTCAAAAAGGG AAGT TTTG AAATTGGAGC CA C AGTTT AC C C 
TGTTGCTATCAAGTATGACCCTCAATTTGGCGATGCCTTCTGGAACAGCAGCAAATACGGGA 
TGGTGACGTACCTGCTGCGAATGATGACCAGCTGGGCCATTGTCTGCAGCGTGTGGTACCTG 
CCTCCCATGACTAGAGAGGCAGATGAAGATGCTGTCCAGTTTGCGAATAGGGTGAAATCTGC 
CATTGCCAGGCAGGGAGGACTTGTGGACCTGCTGTGGGATGGGGGCCTGAAGAGGGAGAAGG 
TGAAGGACACGTTCAAGGAGGAGCAGCAGAAGCTGTACAGCAAGATGATCGTGGGGAACCAC 
AAGGACAGGAGCCGCTCCTGAGCCTGCCTCCAGCTGGCTGGGGCCACCGTGCGGGGTGCCAA 
CGGGCTCAGAGCTGGAGTTGCCGCCGCCGCCCCCACTGCTGTGTCCTTTCCAGACTCCAGGG 
CTCCCCGGGCTGCTCTGGATCCCAGGACTCCGGCTTTCGCCGAGCCGCAGCGGGATCCCTGT 
GCACCCGGCGCAGCCTACCCTTGGTGGTCTAAACGGATGCTGCTGGGTGTTGCGACCCAGGA 
CGAGATGCCTTGTTTCTTTTACAATAAGTCGTTGGAGGAATGCCATTAAAGTGAACTCCCCA 
CCTTTGCACGCTGTGCGGGCTGAGTGGTTGGGGAGATGTGGCCATGGTCTTGTGCTAGAGAT 
GGCGGTACAAGAGTCTGTTATGCAAGCCCGTGTGCCAGGGATGTGCTGGGGGCGGCCACCCG 
CTCTCCAGGAAAGGCACAGCTGAGGCACTGTGGCTGGCTTCGGCCTCAACATCGCCCCCAGC 
CTTGGAGCTCTGCAGACATGATAGGAAGGAAACTGTCATCTGCAGGGGCTTTCAGCAAAATG 
AAGGGTTAGATTTTTATGCTGCTGCTGATGGGGTTACTAAAGGGAGGGGAAGAGGCCAGGTG 
GGCCGCTGACTGGGCCATGGGGAGAACGTGTGTTCGTACTCCAGGCTAACCCTGAACTCCCC 
ATGTGATGCGCGCTTTGTTGAATGTGTGTCTCGGTTTCCCCATCTGTAATATGAGTCGGGGG 
GAATGGTGGTGATTCCTACCTCACAGGGCTGTTGTGGGGATTAAAGTGCTGCGGGTGAGTGA 
AGGACACATCACGTTCAGTGTTTCAAGTACAGGCCCACAAAACGGGGCACGGCAGGCCTGAG 
CTCAGAGCTGCTGCACTGGGCTTTGGATTTGTTCTTGTGAGTAAATAAAACTGGCTGGTGAATGA 
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FIGURE 167 

MFLLLPFDSL I VNLLGI SLTVLFTLLLVF 1 1 VPA I FGVSFGI RKLYMKSLLK I F AWATLRME 
RGAKEKNHQLYKPYTNGIIAKDPTSLEEEIKEIRRSGSSKALDNTPEFELSDIFYFCRKGME 
TIMDDEVTKRFSAEELESWNLLSRTNYNFQYISLRLTVLWGLGVLIRYCFLiLPLRIALAFTG 
I SLLWGTT WG YLPNGRFKEFMS KHVHLMCYRI CVRALTAI I TYHDRENRPRNGG I CVANH 
TS P I DV 1 1 LASDG Y YAIWGQVHGGLMGVI QRAIWKACPHVWFERS E VKDRHLVAKRLTEHVQ 
DKSKLP I L I FPEGTC INNTSVMMFKKGSFE IGATVY PVAI KYDPQFGDAFWNSSKYGMVTYL 
LRMMTSWAIVCSVWYLPPMTREADEDAVQFANRVKSAIARQGGLVDLLWDGGLKREKVKDTF 
KEEQQKLYSKMIVGNHKDRSRS 
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FIGURE 168 

GCCCCTCGAAACCAGGACTCCAGCACCTCTGGTCCCGCCCTCACCCGGACCCCTGGCCCTCA 
CGTCTCCTCCAGGGATGGCGCTGGCGGCTTTGATGATCGCCCTCGGCAGCCTCGGCCTCCAC 
ACCTGGCAGGCCCAGGCTGTTCCCA^^ATCCTGCCCCTGGGCCTGGCTCCAGACACCTTTGA 
CGATACCTATGTGGGTTGTGCAGAGGAGATGGAGGAGAAGGCAGCCCCCCTGCTAAAGGAGG 
AAATGGCCCACCATGCCCTGCTGCGGGAATCCTGGGAGGCAGCCCAGGAGACCTGGGAGGAC 
AAGCGTCGAGGGCTTACCTTGCCCCCTGGCTTCAAA iCCCAGAATGGAATAGCCATTATGGT 
CTACACCAACTCATCGAACACCTTGTACTGGGAGTTGAATCAGGCCGTGCGGACGGGCGGAG 
GCTCCCGGGAGCTCTACATGAGGCAC^TTCCCTTCAAGGCCCTGCATTTCTACCTGATCCGG 
GCCCTGCAGCTGCTGCGAGGCAGTGGGGGCTGCAGCAGGGGACCTGGGGAGGTGGTGTTCCG 
AGGTGTGGGCAGCCTTCGCTTTGAACCCAAGAGGCTGGGGGACTCTGTCCGCTTGGGCCAGT 
TTGCCTCCAGCTCCCTGGATAAGGCAGTGGCCCACAGATTTGGGGAGAAGAGGCGGGGCTGT 
GTGTCTGCGCCAGGGGTGCAGCTAGGGTCACAATCTGAGGGGGCCTCCTCTCTGCCCCCCTG 
GAAGACTCTGCTCTTGGCCCCTGGAGAGTTCCAGCTCTCAGGGGTTGGGCCCTGAAAGTCCA 
ACATCTGCCACTTAGGAGCCCTGGGAACGGGTGACCTTCATATGACGAAGAGGCACCTCCAG 
CAGCCTTGAGAAGCAAGAACATGGTTCCGGACCCAGCCCTAGCAGCCTTCTCCCCAACCAGG 
ATGTTGGCCTGGGGAGGCCACAGCAGGGCTGAGGGAACTCTGCTATGTGATGGGGACTTCCT 
GGGACAAGCAAGGAAAGTACTGAGGCAGCCACTTGATTGAACGGTGTTGCAATGTGGAGACA 
TGGAGTTTTATTGAGGTAGCTACGTGATTAAATGGTATTGCAGTGTGGA 
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FIGURE 169 

MALAALMIALGSLGLHTWQAQAVPTILPLG 

ALLRESWEAAQETWEDKRRGLTLPPGFKAQNGIAIMVYTNSSNTLYWELNQAVRTGGGSREL 
YMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEWFRGVGSLRFEPKRLGDSVRLGQFASSS 
LDKAVAHRFGEKRRGCVSAPGVQLGSQSEGASSLPPWKTLLLAPGEFQLSGVGP 
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FIGURE 170 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCA 
C C CT CAT C TAT AT C CTTTGG C AGCT CAC AGGGT C AG C AG C CT CT GG AC C C GT GAAAGAG C TG 
GTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTC 
TATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCA 
I TAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAG 
| CTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACT 
CCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAG 
TCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATG 
GAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTC 
CCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAGAAAGTGATATGACCTTCATCT 
GCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCCATCCTTGCCAGGAAGCTCTGT 
GAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCTGTGTCTCCTGTTGGTGCCCCT 
CCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTCTGAAGAGAGAGAGACAAGAAG 

i 

$ AGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAAACTCCTAACATATGCCCCCAT 
TCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAATAGAACAATCCTAAAGGAAGA 
TCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGATGGAAAATCCCCACTCAC 
TGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTATCTAGACAGCAGTG 
CACTCCCCTAAGTCTCTGCTCA 
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FIGURE 171 

MAGSPTCLTL I Y I LWQLTGS AASGPVKELVGS VGGAVT F PLKS KVKQVDS I VWTFNTTPL VT 
IQPEGGTI IVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHV 
YEHLSKPKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRW 
GESDMTFICVARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLW 
FLKRERQEEYI EEKKRVD I CRETPNI CPHSGENTE YDT I PHTNRT I LKEDPANTVYSTVE I P 
KKMENPHSLLTMPDTPRLFAYENVI 
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FIGURE 172 

CTGGTTCCCCAACATGCCTCACCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCC 
TCTGGACCCGTGAAAGAGCTGGTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTC 
CAAAGTAAAGCAAGTTGACTCTATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATAC 
AGCCAGAAGGGGGCACTATCATAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCA 
GATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGT 
GGGGAT AT ACAG CTCATCACTC CAGCAG CCCTCC AC CC AGGAGTACGTGCTGCATGTCTACG 
AGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTG 
ACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCT 
GGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAG 
AAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGT CAGCAG AAACTTCTCAAGCCCC 
ATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCT 
GTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTC 
TGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAA 
ACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAA 
TAGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAA 
AGATGGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAG 
AATGTTATCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCAAAAAAAAAAAAAAAAAAA 
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FIGURE 173 

GAAAGACGTGGTCCTGACAGACAGACAATCCTATTCCCTACCAAAATTGAAGATGCTGCTGCT 
GCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAAGCTAGTTCTACGGGAAGGA 
ACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGACAAAAGA 
GAAAAGATAGAAGAACATGGCAACTTTAGACTTTTTCTGGAGCAAATCCATGTCTTGGAGAA 
TTCCTTAGTTCTTAAAGTCCATACTGTAAGAGATGAAGAGTGCTCCGAATTATCTATGGTTG 

i CTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACGTATGATGGATTCAATACATTTACT 
ATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAACGAAAAGGATGGGGA 

■ ■ AACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGTTCAGACATCAAGGAAA 

c GGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAATATCATTGACCTATCCAAT 
GCCAATCGCTGCCTCCAGGCCCGAGAATGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAC 
ACTTCTCACCAGGACTCCAC-ATCATCCCTTCCTATCCATACAGCATCCCCAGTATAAATTC 
TGTGATCTGCATTCCATCCTGTCTCACTGAGAAGTCCAATTCCAGTCTATCAACATGTTACC 
TAGGATACCTCATCAAGAATCAAAGACTTCTTTAAATTTCTCTTTGATACACCCTTGACAAT 

aI TTTTCATGAAATTATTCCTCTTCCTGTTCAATAAATGATTACCCTTGCACTTAA 
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FIGURE 174 

MKMLLLLCLGLTLVCVHAEEASSTGRNFNVEKINGEWHTIILASDKREKIEEHGNFRLFLEQ 
IHVLENSLVLKVHTVRDEECSELSMVADKTEKAGEYSVTYDGFNTFTIPKTDYDNFLMAHLI 
NEKDGETFQLMGLYGREPDLSSD I KERFAQLCEEHG I LREN 1 1 DLSNANRCLQARE 
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FIGURE 175 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAA 
TGGATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTG 
TCAGCTTAGTTGAGGAAGACCAAT^TCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTC 
CCAGGAATTATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAG 
^ AAAAAGAGCGTGCTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCA 
% CAGTCATTGGTGCTCTGTATTGCATGCTGATATCC/iTCCAGGCTCTCTTAAAAGGTCCTCTC 
ATGT GT AATT CT CC AAGC AAC AGT AATG CC AATTGTG AATTTT CATTGAAAAACAT CAGTGA 
CATT CATCCAGAAT CCTTCAACTTGr 1 AGTGGTTTTT CAATGACTCTTGTGCACCTCCTACTG 
GTTTCAATAAACCCACCAGTAACGACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCAC 
TTCGATTCTGAAGAAAACAAACATAGGCTTATCCACTTCTCAGTATTTTTAGGTCTATTGCT 
TGTTGGAATTCTGGAGGTCCTGTTTGGGCTCAGTCAGATAGTCATCGGTTTCCTTGGCTGTC 
TGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTGTGTAGTTTAATGGGAATAAAATGTAAGTA 

^ TCAGTAGTTTGAAAAAAAAAAA 

% 
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FIGURE 176 

MTCCEGWTSCNGFSLLVLLLLGVVLNAIPLIVSLVEEDQFSQNPISCFEWWFPGI IGAGLMA 
IPATTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNA 
NCEFSLKNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRL 
IHFSVFLGLLLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 
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FIGURE 177 

GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATGAGGCT 
GTCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 
GCCCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTC 
CAAGTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTG 
CACCGATCAGATATCTTTTAAGAAACGACTCTCATTGAAAAAGTCCTGGTGGAAATAGTGAA 
AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTCAACGACACC 
CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 
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FIGURE 178 



MRLSVCLLMVSLALCCYQAHALVCPAVASEITVFLFLSDAAVNLQVAKLNPPPEALAAKLEV 
KHCTDQ I S FKKRLSLKKSWWK 
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FIGURE 179 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGG 
TGAAGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTG 
GAGCAGATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTC 
GAACTGTGACATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGG 
AAGCCAATGACCCATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAG 
CTGAGCGGACTGATCTGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGG 
CAAATG CAAATACAAGAGCAGCCAGAAG CAGCAC AGTC CTGTACCTGAGAAGGCCATCCCAC 
TCATCACTCCAGGCTCTGCCACTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGA 
AGCCTAACACTGGCCCCCAGCACCTCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTC 
TCTCCAAGGGCAGGCTGTTAGGCCCCTTTCTGATCAGGAGGCTTCTTTATGAATTAAACTCG 
CCCCACCACCCCCTCA 
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FIGURE 180 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGL I CGGLLAI AGIAAVLSGKCK 
YKS SQKQHS P VPEKAI PL I TPGSATTC 
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FIGURE 181 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCC 
TGGCTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTC 
CTGGCTACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCC 
AGTGTTTCCCACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCT 
ACAGAGGAGGGCTTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGT 
ATGGCTGGGTCCCATCATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCA 
CCAATGCCTCAGCTGCCATTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGG 
CTGGGAGAAGGGATACTGCTGAGTGG^^GTGACAAGTGGAGCCGCCACCGTCGGATGCTGAC 
GCCGGCCTTCCATTTCAACATCCTGAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACA 
TCATGCTTGACAAGTGGCAGCACCTGGCCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAG 
CACATCAGCCTCATGACCTTGGACAGTCTACAGAAATGCATCTTCAGCTTTGACAGCCATTG 
TCAGGAGAGGCCCAGTGAATATATTGCCACCATCTTGGAGCTCAGTGCCCTTGTAGAGAAAA 
GAAGCCAGCATATCCTCCAGCACATGGACTTTCTGTATTACCTGTCCCATGACGGGCGGCGC 
TTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGACGCTGTCATCCGGGAGCGGCGTCG 
CACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAAAGCCAAGTCCAAGACTTTGG 
ATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGGCATTGTCAGATGAGGAT 
ATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACGGCCAGTGGCCTCTC 
CTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCGACAGGAGGTGC 
AAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCCAGCTGCCC 
TTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATCTCCCG 
ATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTGCC 
TCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGAC 
CCCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTC 
CGCAGGG<wCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGG 
CGTTGATGCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAA 
TTGATCATGCGGGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCA 
GTGACTTTCTGACCCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 182 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAW 

HLGLITPTEEGLKDSTQMSATYSQGFTVWLGPI IPFIVLCHPDTIRSITNASAAIAPKDNLF 
I RFLKPWLGEG I LLSGGDKWSRHRRMLTPAFHFN ILKSYITI FNKSAN I MLDKWQHLASEGS 
SRLDMFEHISLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYY 
LSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDG 
KALSDEDIRAEADTFMFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEW 
DDLAQL P FLTMC VKES LRLHPPAP F I SRCCTQD I VL P DGRV I P KG I TCL I D 1 1 GVHHNPT VW 
PDPEVYDPFRFDPENSKGRSPLAFIPFSAGPRNCIGQAFAMAEMKVVLALMLLHFRFLPDHT 
E PRRKLEL I MRAEGGLVtf LRVE PLNVGLQ 
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FIGURE 183 

CAACAGAAGCCAAGAAGGAAGCCGTCTATCTTGTGGCGATCATGTATAAGCTGGCCTCCTGC 
TGTTTGCTTTTCACAGGATTCTTAAATCCTCTCTTATCTCTTCCTCTCCTTGACTCCAGGGA 
AATATCCTTTCAACTCTCAGCACCTCATGAAGACGCGCGCTTAACTCCGGAGGAGCTAGAAA 
GAGCTTCCCTTCTACAGATATTGCCAGAGATGCTGGGTGCAGAAAGAGGGGATATTCTCAGG 
AAAGCAGACTCAAGTACCAACATTTTTAACCCAAGAGGAAATTTGAGAAAGTTTCAGGATTT 
CTCTGGACAAGATCCTAACATTTTACTGAGTCATCTTTTGGCCAGAATCTGGAAACCATACA 
AGAAACGTGAGACTCCTGATTGCTTCTGGAAATACTGTGTCTGAAGTGAAATAAGCATCTGT 
TAGTCAGCTCAGAAACACCCATCTTAGAATATGAAAAATAACACAATGCTTGATTTGAAAAC 
AGTGTGGAGAAAAACTAGGCAAACTACACCCTGTTCATTGTTACCTGGAAAATAAATCCTCT 
ATGTTTTGCACAAAAAAAAAAAAAAA 
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FIGURE 184 



MYKLASCCLLFTGFLNPLLSLPLLDSREISFQLSAPHEDARLTPEELERASLLQILPEMLGA 
ERGDILRKADSSTNIFNPRGNLRKFQDFSGQDPNILLSHLLARIWKPYKKRETPDCFWKYCV 
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FIGURE 1 85 



GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGAT 
GGGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCA 
CCACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGCCATGCCCTCCCC 
AGGGACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCT 
CCAGCTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCA 
GCCAAGCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGA 
AGGGGCAGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGT 
CAGGGGTTCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGG 
GAAGAGGCCAAAGAGGCCCCAGCCGACAAGTGATCGCCCACAAGCCTTACTCACCTCTCTCT 
AAGTTTAGAAGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTA 
CAAGCTCAGGAGGCGAATAAATGTTCAAACTGTA 
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FIGURE 186 

MPSPGTVCSLLLLGMLWLDLAMAGSSFLSPEHQRVQQRKESKKPPAKLQPRALAGWLRPEDG 
GQAEGAEDELEVRFNAPFDVG I KLSGVQYQQHSQALGKFLQD I LWEEAKEAPADKO 
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FIGURE 187 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTC 
GTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGT 
TCCTCCCCCTGTTCCCGGTGCAGGTGCAGACGCTGATAGTCGTGATCATCGGGATGCTCGTG 
CTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCACATCTACCT 
GAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATC 
TTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTT 
GTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAA 
GG ~ TAAATCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAAC 
TCACGCCGAGACCTGCAGGAGTGGTGCCAGGTGCTTGAAGTAACAAGTTTAAAATGTTCAGA 
GACAATGGAATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTT 
CCAAAAACACAAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCA 
AACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTC 
AGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGT 
AACAACAACCTCCCTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCG 
TCTTCCATTCTCTGGGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGAC 
ACGGGCAGCAGAGTGTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCA 
ACCCCTGGTCAGGGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGA 
GCATCCCCTGCCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGA 
GCCTCCTTGTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGAT 
GTTTTAAAACACACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGA 
ATTTTTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 188 

MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGW 
WRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSMRTQQ 
AQQEAELTPRPAGWPGA 



*0 
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FIGURE 189 

GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACCATGGCCAAG 
ATGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGCATGCT 
ATCACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTGGGCACACAGAAGG 
TGCCCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTG 
GATGGAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGA 
CCGGTTCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAG 
AACCAGGGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAA 
GGACTACTGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAA. 
GCGGTTGATGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTA 
TGGTTATCCCTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCT 
ACTAACAGACTTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCCTTTGCTG 
CTGTTTCCTCTGTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTC 
TTCCAAGCGACTGTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTG 
GGCCTTCTACATGGCCTGGCTCTC CTTCAC CTG CTG CATGG CGT CGGCTG TCAC CAGCTT CA 
ACACGTACACCAGGATGGTGCTGGAGTTCAAGTGCAAGCA TAG TAAGAGCTTCAAGGAAAAC 
CCGAACTGCCTACCACATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCAC 
CGTGGGTCCTTTGACCAGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGG 
GAGTCGACTTCTACTCCGAGCTGCGGAACAAGGGATTTCAAAGAGGGGCCAGCCAGGAGCTG 
AAAGAAGCAG^TAGGTCATCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGA 
GTAGGCTTGAGCCCTACCTTACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCG 
TCTCTTGAGCATGGTTTTTAGAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTC 
CTAAGGGATTCCTGGGTGCCACTGCTCTCTTTTCCTCTAGAGCTCCATCTTGTTTCACCCAC 
CCCACATCTCACACATCCAGAATTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGC 
TAAAC CATGGAGATAAAAAGAAGAGTAAAATACACTT C CCGACCTTAAGGAT CTGAAA 
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FIGURE 190 

MAKMELSKAFSGQRTLLSAILSMLiSLSPSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMP 
VSLDGDTNTSTQEWQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKR 
GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQL 
PPATNRLATHWEPCLWAQTERLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLE 
LWLGLLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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FIGURE 191 

AACTGGAAGGAAAGAAAGAAAGGTCAGCTTTGGCCCAGATGTGGTTACCCCTTGGTCTCCTG 
TCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAAGTCTCAGGCCTGTCA 
G C AG CT C CTGTGGAC ATTGC CAT CCi^CTCT GGTAG C CT T C AG AG C AAAC AGG A C AA C CT ATG 
TTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGTGCTTCTGT 

v ; GATCTCTATGACAGAGCCACTTCTCCACCTCTGAAATGTTCCCTGCTCTGAAATCTGGCATG 

$ t AGATGGCACAGGTGACCACGCAGAAGCCACCAGAAT CTTGCCTGCCCTATTCCTCCTCCCAA 

GTCTGTTCTCTTATTGTCAACCTCAGCACAACAGGCTGGCGCCAATGGCATTACAGAGAAAG 

./* CAATCTGTGTGGCTAGTGGGCAGATT^CCATGCAAGCCCCAGGAGAAATGGAGGAGCTTTGT 
AGCCACCTCCCTGTCAGCCAGTATTAACATGTCCCCTTCCCCCTGCCCCGCCGTAGATTCAG 
GACATTCGCCCCTGTGTGCCACCAAACCAGGACTTTCCCCTTGGCTTGGCATCCCTGGCTCT 
CTCCTGGTACCCAGCAAGACGTCTGTTCCAGGGCAGTGTAGCATCTTTCAAGCTCCGTTACT 
ATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGAATAATCAAGTGGGAAGGGGAAGCA 
GAGGGAAATGGGGCCATGTGAATGCAGCTG CT CTGT T CT C C CT AC C CTGAGGAAAAAC CAAA , 

^ GGGAAGCAACAGGAACTTCTGCAACTGGTTTTTATCGGAAAGATCATCCTGCCTGCAGATGC 
TGTTGAAGGGGCACAAGAAATGTAGCTGGAGAAGATTGATGAAAGTGCAGGTGTGTAAGGAA 
ATAGAACAGTCTGCTGGGAGTCAGACCTGGAATTCTGATTCCAAACTCTTTATTACTTTGGG 
AAGTCACTCAGCCTCCCCGTAGCCATCTCCAGGGTGACGGAACCCAGTGTATTACCTGCTGG 
AACCAAGG AAACTAAC AATGTAGGTT ACTAGT GAAT AC C C CAATGGTTTCTCCAATTATGC C 
CATGCCACCAAAACAATAAAACAAAATTCTCTAACACTGAAA 
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FIGURE 192 

MWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSGME 
HRNHLCFCDL YDRATS PPLKCS LL 
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FIGURE 193 

GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGCCA 
GGAGCGACGTCACCGCCATGGCAGGCATCAAAGCTTTGATTAGTTTGTCCTTTGGAGGAGCA 
ATCGGACTGATGTTTTTGATGCTTGGATGTGCCCTTCCAATATACAACAAATACTGGCCCCT 
CTTTGTTCTATTTTTTTACATCCTTTCACCTATTCCATACTGCATAGCAAGAAGATTAGTGG 
ATGATACAGATGCTATGAGTAACGCTTGTAAGGAACTTGCCATCTTTCTTACAACGGGCATT 
GTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGATTGAGTGGGGAGC 
TTGTGCACTTGTTCTCACAGGAAACACAGTCATCTTTGCAACTATACTAGGCTTTTTCTTGG 
TCTTTGGAAGCAATGACGACTTCAGCTGGCAGCAGTGGTGAAAAGAAATTACTGAACTATTG 
TCAAATGGACTTCCTGTCATTTGTTGGCCATTCACGCACACAGGAGATGGGGCAGTTAATGC 
TGAATGGTATAGCAAGCCTCTTGGGGGTATTTTAGGTGCTCCCTTCTCACTTTTATTGTAAG 
CATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGATTTTCTCTTTTGGAAAAGCTTG 
ACTGATTTCACACTTATCTATAGTATGCTTTTTGTGGTGTCCTGCTGAATTTAAATATTTAT 
GTGTTTTTCCTGTTAGGTTGATTTTTTTTGGAATCAATATGCAATGTTAAACACTTTTTTAA 
TGTAATCATTTGCATTGGTTAGGAATTCAGAATTCCGCCGGCTCTATTACTGGTCAAGTACA 
T C TT TT CT CTTAAAAT T ATTT AG C CT C C ATT ATT A C AAAAAAT T AT AAAAAT AAGT T T T C AG 
TCAGTCAGGATGACATCACTCCCAATGTTATGCAGACATACAGACGGTTGGCATACGTTATA 
GACTGTATACTCAGTGCAAATATAGCTGCATTTATACCTCAGAGGGGCCAAGTGTTAATGCC 
CATGCCCTCCGTTAAGGGTTGTTGGTTTTACTGGTAGACAGATGTTTTGTGGATTGAAAATT 
ATTTTATGGAATTGCTACAGAGGAGTGCTTTTCTTCTCAATTGTTAGAAGAATTTATGTTAA 
ACTTTAAGGTAAGGGTGTAAAAACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGT 
TAGAGTGAGTTGCAATGTGGGAAGAAATGACATTGAAATT CC AG TTTTTGAATC CTGTTT CT 
ATTTATAAGTGAAATTTGTGATCTCCTATCAACCTTTCATGTTTTACCCTGTTAAAATGGAC 
ATACATGGAACCACTACTGATGAGGGACAGTTGTATGTTTGCATCATATATGCCAGAAAACC 
TTCCTCTGCTTCCTCCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAATAACTTTT 
CAAATATAGTTTAATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTA 
CATTCAGAGTGCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTT 
ACAGATAATTCATGCATTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTC 
TAAGGTTATATCATATGTAATTTAAAAGTATTTTTAAGACAAGTTTCCTG^ATACCTCTGAA 
CTGTTTTGATTTTGAGTTCATCATGATAGATCTGCTGTTTCCTTATAAAAGGCATTTGTTGT 
GTGAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACATACCTGACCAA 
AAAATTCCCAGTAACCAGGCATGATCAATTTATAGTGGTCGTTTACATCTAATAATTATCAG 
GACTTTTTTCAGGAGTGGGTTATAAAAACATTCAAGTTGGTCTGACAGTATTTTGTTAAGGA 
TATTTGTTTGTATGTTTATTCAGTATACTTACATAAAAATTATTTCGCCATCAGCCAAAACT 
CAGTAATCATGACAGCTGTCTGTTGTTTTATGAAGTTTATTTCTCAAGAAAATGGGAATAAA 
TTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAAAGCCACAGGTTTTATTGCCTAACT 
TAAGCCATGACTTTTAGATATGAGATGACGGGAAGCAGGACGAAATATCGGCGTGTGGCTGG 
AGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCAGATTTCAAGAGGAA 
GGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAACTCTTTGTGCTTGTGAT 
CTACTGGACTTTTTTTTTGCAGGAAGTGCATTCTCTGGTCCTTCCCTATTTTCTGTTCTGGA 
TGTCAGTGCAGTGCACTGCTACTGTTTTATCCACTTGGCCACAGACTTTTTCTAACAGCTGC 
GTATTATTTCTATATACTAATTGCATTGGCAGCATTGTGTCTTTGACCTTGTATACTAGCTT 
GACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACTTGAGATATGAATTTTCCATAGAATAT 
GC ACTGAT AC AACATT AC CATTCTTCTATGGAAAGAAAACTTTTGATGATGAAACAATAAAG 
ATTTTAAATATCTATTTTAAAAAAAAAA 
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MAGI KALI SLSFGGAIGLMFLMLGCALP IYNKYWPLFVLFFY I LSP I PY C I ARRLVDDTDAM 
SNACKELAIFLTTGIWSAFGLPIVFARAHLIEWGACALVLTGNTVIFATILGFFLVFGSND 
DFSWQQW 
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FIGURE 195A 



CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCC 
CACGCGTCCGCCCACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGA 
GCCCGGGCGCCTCTCGCCGCTCCCCGCGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCG 
CCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCTGCAGCTGCCCGGGGCGCGAATGG 
CAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAATGACGGACATT 
AACCAGACTGTCAGATCCTGGGGAGTCGGGAGCCCCGAGTTTGGAGTTTTTTCCCCCCACAA 
CGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGC 
ACGTAGTTGGGAAACTTGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCA 
GCCCCGAGCCGAGCAGCAAAGTGAGACATTGTGCGCCTGCCAGATCCGCCGGCCGCGGACCG 
GGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTCGCCCTGCATATAATTAGCCTGCACACAA 
AGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCTGACCGAGCGCTTCCAA 
TGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAATGGATTTCCTGCTGCTCGGTCTCT 
GTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTT 
CAGATGCTGCCCGCCGCCCCCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCT 
GTACTGCGAGGCGCTCAACCTCACCGAGGCGCCCCACAACCTGTCCGGCCTGCTGGGCTTGT 
CCCTGCGCTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTC 
ACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACT 
GCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCT 
TCCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCG 
CCCGAGCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCA 
GTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACA 
ATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGCTCACCGAGCTGCAC 
CTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTGCA 
CTGGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGACTGGGTTTGGA 
ACCTGGAGAAAATGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAG 
ACCGTGCCGCACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCG 
GATCCTCAACTCTTGGAAGTCCCTGACAAGCATCACCCTGGCCGGGAACCTGTGGGATTGCG 
GGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCTACGATGGCAAC 
TTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTT 
CCACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACC 
GCAGTGATGTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAG 
CACGACGGCACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGC 
CGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCTCATCTTCTCCTTGCTCATCGTGG 
TCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCAGACAGTGC 
TTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGGCTGCCATGTC 
TGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTGATCATCA 
ACGAGTAT GG CT CGTGTACC TGC C AC CAG C AG C C CG CGAGGG AATG CG AGG TGTGATTGT C C 
CAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGGGCAGCCGGGACGGGCCGGCGGGCA 
CCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGACTGAAACTTTAAGGGGATC 
TCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAAACAAAAGCGAATTAAAACAC 
AACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATCTTAAATTTCATATGAGAACTCC 
TTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAA 
GACAGAGAGAGAATAATCGTGCTTTCTTTTATGCTACTCCTCCCACCCTGCCCATGATTAAA 
CATCATGTATGTAGAAGATCTTAAGTCCATACGCATTTCATGAAGAACCATTGGAAAGAGGA 
ATCTGCAATCTGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTATGTTCTTACTTTGTG 
TGTGTGTCTGTATGTTTCTGCGTTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACAAA 
CGGGAATTTAGCTCACATCATTTCATGCCCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGG 
AGGTGGGGGGAAACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACT 
TGAAATCTTTTCTTTCTCAAATTAATTATCTTTAAGCTTCAAGAAACTTGCTCTGACCCCTC 
TAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATTTTTAAAGGTGTAGCACCTTTTTTTT 
TATTCTTCCCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAAGAACTTAAGGCC 
ACAATTCACGTCTCGTCCTGGGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCA 
GCTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAAAAAGTCTT 
TTCACATGACAAATGACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCA 
GAGGAAGCAGGTTAAATGAGACCTGTCCTCTGCTGCACTCAGAAAAAATAGGCAGTCCCTGA 
TGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACCTACCCACAATGCAGCCTATACT 
CCCAAGACTACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAAATAGTTTTC 
TCAACCATTTAAAAATATTCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCT 
CTGCCTTCAAGAAGGCAGACATTTGGTATGATTTAGCATCAACAACACATTTATGAGTATAT 
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FIGURE 195B 

GTAAGTAATCAGAGGGGCAAATGCCACTTGTTATTCCTCCCAAGTTTTCCAAGCAAGTACAC 
ACAGATCTCTGGTAGGATTAGGGGCCACTTGTGTTTCCGGCTTATTTTAGTCGACTTGTCAG 
CAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAGGGCATTGATGGATCACATG 
AGATGGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCAAAGAACCAGAA 
ATTAT AT CTGTT TTGG AG CAAGAG x G'l'CATAATGTTTCAGGGTAGTCAAAATAAACATAAAT 
TATCTCCTCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATGTCAAA 
TAAAAAGGAATTAGCTAGAATATGACCATTAAATGTGCTTCTGAAATATATTTTGAGATAGG 
TTTAGAATGTCA 
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FIGURE 196 

MDFLLLGLCLYWLLRRPSGWLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEAPH 
NLSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAPQKLRRVKELTLSSNQ 
ITQLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRS 
LKFLDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVKVNFAHFPRLISLHSLCLRRNKVAIV 
VSSLDWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSLTSIT 
LAGNLWDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSG 
HLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIHKWTGTMA 
LIFSFL I WL VL YVSWKCFPASLRQLRQC . VTQRRKQ KQKQTMHQMAAMS AQEYYVD YKPNH 
I E GAL VI INEYGSCTCHQQPARECEV 
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FIGURE 197 

GTGCAAGGAGCCGAGGCGAGATGGGCGTCCTGGGCCGGGTCCTGCTGTGGCTGCAGCTCTGC 
GCACTGAGCCAGGCGGTCTCCAAACTCTGGGTCCCCAACACGGACTTCGACGTCGCAGCCAA 
CTGGAGCCAGAACCGGACCCCGTGCGCCGGCGGCGCCGTTGAGTTCCCGGCGGACAAGATGG 
TGTCAGTCCTGGTGCAAGAAGGTCACGCCGTCTCAGACATGCTCCTGCCGCTGGATGGGGAA 
i CTCGTCCTGGCTTCAGGAGCCGGATTCGGCGTCTCAGACGTGGGCTCGCACCTGGACTGTGG 
^ CGCGGGCGAACCTGCCGTCTTCCGCGACTCTGACCGCTTCTCCTGGCATGACCCGCACCTGT 
GGCGCTCTGGGGACGAGGCACCTGGCCTCTTCTTCGTGGACGCCGAGCGCGTGCCCTGCCGC 
- CACGACGACGTCTTCTTTCCGCCTAGTGCCTCCTTCCGCGTGGGGCTCGGCCCTGGCGCTAG 
..; CCCCGTGCGTGTCGGCAGCATCTCGGCTCTGGGCCGGACGTTCACGCGCGACGAGGACCTGG 
CTGTTTTCCTGGCGTCCCGCGCGGGCCGCCTACGCTTCCACGGGCCGGGCGCGCTGAGCGTG 
GGCCCCGAGGACTGCGCGGACCCGTCGGGCTGCGTCTGCGGCAACGCGGAGGCGCAGCCGTG 
GATCTGCGCGGCCCTGCTCCAGCCCCT 

?£4 
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FIGURE 198 

MGVLGRVLLWLQLCALTQAVSKLWVPNTDFDVAANWSQNRTPCAGGAVEFPADKMVSVLVQE 
GHAVSDMLLPLDGELVLASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEA 
PGLFFVDAERVPCRHDDVFFPPSASFRVGLGPGASPVRVRSISALGRTFTRDEDLAVFLASR 
AGRLRFHGPGALSVGPEDCADPSGCVCGNAEAQPWICAALLQP 
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atcgcatcaattgggagtaccatcttcctcatgggaccagtgaaacagctgaagcgaatgtt 

tgagcctactcgtttgattgcaactatcatggtgctgttgtgttttgcacttaccctgtgtt 

ctgccttttggtggcataacaagggacttgcacttatcttctgcattttgcagtctttggca 

ttgacgtggtacagcctttccttcataccatttgcaagggatgctgtgaagaagtgttttgc 

cgtgtgtcttgcataattcatggccagttttatgaagctttggaaggcactatggacagaag 

ctggtggacagttttgtaactatcttcgaaacctctgtcttacagacatgtgccttttatct 

tgcagcaatgtgttgcttgtgattcgaacatttgagggttacttttggaagcaacaatacat 

tctcgaacctgaatgtcagtagcacaggatgagaagtgggttc:tgtatcttgtggagtggaa 

tcttcctcatgtacctgtttcctctctggatgttgtcccactgaattcccatgaatacaaac 

ctattcagcaacagcaaaaaaaaaaaaaaaaaaaaaaaa^^ 

aaaaaaaaaaaaaaaaaaaaaaaa 
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FIGURE 200 

MGPVKQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGLALIFCILQSLALTWYSLSFrP 
FARDAVKKCFAVCLA 
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FIGURE 201 

TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTA 
CCCGCGGCGCGCCCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACG 
GACTGAACGGATCGCTGCGAGGGTGGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCA 
ACTACCATCCATAGCCAGATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAAATGAC 
^ TTCGAAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTCAACCACCTTTTCTC 
TCCAACTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACTTA 
4| TATAAAGTTCCAACGCCCCATTTTCATTATATTATG7VAATATGGTGTTCACGTGAAGCAAGT 
$ TACTAATGTTTTTATTACAAAAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTG 
CAGAGAATCATGGGATTGTTGCAAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCC 
TTGGATCACATGAATATTTATGATTCCAAGTTTTGGGAAGAAGCGACACCAATATGGATCAC 
AAACCAGAGGGCAGGACATACTAGTGGTG _AGCCATGTGGCCCGGAACAGATGTAAAAATAC 
ATAAGCGCTTTCCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAGAGTT 
GCCAAAATTGTTGAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGA 
AGACCCTGATGACATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTT 
CAGATATTGACAAGAAGTTAGGATATCTCATACAAATGCTGAAAAAGGCAAAGTTGTGGAAC 
ACTCTGAACCTAATCATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAAT 
AGAACTTGAC CAGTACCTGGATAAAG ACCACTAT AC CC TGATTGAT CAAT CT CCAGTAGCAG 
C C AT CTTG CCAAAAGAAGGTAAATTTGATG AAG T CT AT GAAG C ACT AACT C ACGCT CAT C CT 
f ,.. AATCTTACTGTTTACAAAAAAG AAG ACGTT CC AG AAAG GT GG C ATT AC AAATACAACAGT CG 

M AATTCAACCAATCATAGCAGTGGCTGATGAAGGGTGGCACATTTTACAGAATAAGTCAGATG 
ACTTTCTGTT AGG C AACC ACGGTT AC GAT AAT G CGT TAG C AG AT AT G CAT CC AAT ATTTTT A 
GCCCATGGTCCTGCCTTCAGAAAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTA 
CCCACTACTATGCCACCTCCTCAATATCACTGCCATGCCACACAATGGATCATTCTGGAATG 
TCCAGGATCTGCTCAATTCAGCAATGCCAAGGGTGGTCCCTTATACACAGAGTACTATACTC 
CT C C CT GGT AGT GTT AAA C CAG C AG AAT AT G A C C AAG AGG G GT CAT AC C C TT ATTT CAT AGG 
GGTCTCTCTTGGCAGCATTATAGTGATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTC 
AC AGT C AAATAC CTG CCTTAC AAG AT ATG C ATG CTG AAATAG C TC AAC CATTATTACAAGC C 
TAATGTTACTTTGAAGTGGATTTGCATATTGAAGTGGAGATTCCATAATTATGTCAGTGTTT 
AAAGGTTTCAAATTCTGGGAAACCAGTTCCAAACATCTGCAGAAACCATTAAGCAGTTACAT 
ATTTAGGTATACACACACACACACACACACACATACACACACACGGACCAAAATACTTACAC 
CTGCAAAGGAATAAAGATGTGAGAGTATGTCTCCATTGTTCACTGTAGCATAGGGATAGATA 
AGATCCTGCTTTATTTGGACTTGGCGCAGATAATGTATATATTTAGCAACTTTGCACTATGT 
AAAGTACCTTATATATTGCACTTTAAATTTCTCTCCTGATGGGTACTTTAATTTGAAATGCA 
CTTTATGGACAGTTATGTCTTATAACTTGATTGAAAATGACAACTTTTTGCACCCATGTCAC 
AGAATACTTGTTACGCATTGTTCAAACTGAAGG AAATT TCT AAT AAT C C CGAATAATGAACA 
TAGAAATCTATCTCCATAAATTGAGAGAAGAAGAAGGTGATAAGTGTTGAAAATTAAATGTG 
ATAACCTTTGAACCTTGAATTTTGGAGATGTATTCCCAACAGCAGAATGCAACTGTGGGCAT 
I TTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCATTTATTTTTCCCTCAAAAGAGAGTC 
AAATACTGACAGATTCGTTCTAAATATATTGTTTCTGTCATAAAATTATTGTGATTTCCTGA 
TGAGTCATATTACTGTGATTTTCATAATAATGAAGACACCATGAATATACTTTTCTTCTATA 
TAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCACCATCTCAGCAATGTTTTCTCTTGTT 
TGTAATTATTTGCTCCTTTGAAAATTAAATCACTATTAATTACATTAAAAATCAAATTGGAT 
AAAAAAAAAAAAAAAAAAA 
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FIGURE 202 

MTSKFILVSFIIiAALSLSTTFSLQLDQQKVLLVSFDGFRWDYLYKVPTPHFHYIMKYGVHVK 
QVTNVF I TKTYPNHYTLVTGLFAENHG I VANDMFD P I RNKS F SLDHMN I YDS KFWE EATP IV? 
ITNQRAGHTSGAAMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPINLGLLY 
WEDPDDMGHHLGPDSPLMGPVISDIDKKLGYLIQMLKKAKLWNTLNLIITSDHGMTQCSEER 
L1ELDQYLDKDHYTLIDQSPVAAILPKEGKFDEVYEALTHAHPNLTVYKKEDVPERWHYKYN 
SR I QP 1 I AVADEGWHI LQNKSDDFLLGNHGYDNALADMHP I FLAHGPAFRKNFSKEAMNSTD 
LYPLLCHLLNITAMPHNGSFWNVQDLLNSAMPRVVPYTQSTILLPGSVKPAEYDQEGSYPYF 
I GV S LGS 1 1 V I VF F V I F I KHL I HS Q I PALQDMHAE I AQ P L LQ A 
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FIGURE 203 

GGATTTTTGTGATCCGCGATTCGCTCCCACGGGCGGGACCTTTGTAACTGCGGGAGGCCCAG 
GACAGGCCCACCCTGCGGGGCGGGAGGCAGCCGGGGTGAGGGAGGTGAAGAAACCAAGACGC 
AGAGAGGC C AAG CCC C TTGC CTTGGG T C AC AC AG C C AAAGGAGG CAGAGCCAGAACT CACAA 

m 

CCAGATCCAGAGGCAACAGGGACATGGCCACCTGGGACGAAAAGGCAGTCACCCGCAGGGCC 
H AAGGTGGCTCCCGCTGAGAGGATGAGCAAGTTCTTAAGGCACTTCACGGTCGTGGGAGACGA 
S CTACCATGCCTGGAACATCAACTACAAGAAATGGGAGAATGAAGAGGAGGAGGAGGAGGAGG 
AGCAGCCACCACCCACACCAGTCTCAGGCGAGGAAGGCAGAGCTGCAGCCCCTGACGTTGCC 
CCTGCCCCTGGCCCCGCACC.CAGGGCCCCCCTTGACTTCAGGGGCATGTTGAGGAAACTGTT 
CAGCTCCCACAGGTTTCAGGTCATCATCATCTGCTTGGTGGTTCTGGATGCCCTCCTGGTGC 
TTGCTGAGCTCATCCTGGACCTGAAGATCATCCAGCCCGACAAGAATAACTATGCTGCCATG 
GTATTCCACTACATGAGCATCACCATCTTGGTCTTTTTTATGATGGAGATCATCTTTAAATT 
ATTTGTCTTCCGCCTGAGTTCTTTCACCACAAGTTTGAGATCCTGGATGCCCGTCGTGGTGG 
H TGGTCTCATTCATCCTGGACATTGTCCTCCTGTTCCAGGAGCACCAGTTTGAGGCTCTGGGC 
f CTGCTGATTCTGCTCCGGCTGTGGCGGGTGGCCCGGATCATCAATGGGATTATCATCTCAGT 
TAAGACACGTTCAGAACGGCAACTCTTAAGGTTAAAACAGATGAATGTACAATTGGCCGCCA 
AGATTCAACACCTTGAGTTCAGCTGCTCTGAGAAGCCCCTGGACTGATGAGTTTGCTGTATC 
AACCTGTAAGGAGAAGCTCTCTCCGGATGGCTATGGGAATGAAAGAATCCGACTTCTACTCT 
CACACAGCCACCGTGAAAGTCCTGGAGTAAAATGTGCTGTGTACAGAAGAGAGAGAAGGAAG 
CAGGCTGGCATGTTCACTGGGCTGGTGTTACGACAGAGAACCTGACAGTCACTGGCCAGTTA 
T C AC TT C AGATT AC AAAT C ACACAG AGC AT CTGCCTGTTTT C AAT C AC AAG AGAAC AAAA C C 
AAAATCTATAAAGATATTCTGAAAATATGACAGAATTTGACAAATAAAAGCATAAACGTGTA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

'■fa 
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FIGURE 204 

MATWDEKAVTRRAKVAPAERMSKFLRHFTVVGDDYHAWNINYKKWENEEEEEEEEQPPPTPV 
SGEEGRAAAPDVAPAPGPAPRAPLDFRGMLRKLFSSHRFQVI 1 1 CLVVLD ALLVLAEL I LDL 
KIIQPDKNNYAAIWFHYMSIT^^ 

VLLFQEHQFEALGLLILLRLWRVARIINGIIISVKTRSERQLLRLKQMNVQLAAKIQHLEFS 
CSEKPLD 



BNSDOCID: <W0 9963088A2 I > 



WO 99/63088 & / PCT/US99/12252 

-€o# /3fO 
FIGURE 205 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACAT 
GCTCTGTCTGTGCCTGTACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTG 
AGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCCATTTTCAAGCTCAGTGTCTTCATCCCCTCC 
CAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCTGGAGATAAGGACCT 
TGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGA 
GGCTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATG 
CAGTCCCTGCGGGACTTGGGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAG 
CATGGATAAAAACGGCACGATGACCATCGACTGGAACGAGTGGAGAGACTACCACCTCCTCC 
ACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACGATCTTTGATGTG 
| GGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTG 
GAGACACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGG 
ACAGGCTCAAGGTGCTCATGCAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGT 
GGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGGTCACTCTGGCGGGGCAATGGCATCAA 
CG T C CT C AAAAT T G C C C C C G AAT C AG CC AT C AAATT CAT GGCC TAT GAG C AG AT C AAG CG C C 
TTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCA 
GGGGCCATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCG 
GAAGACAGGCCAGTACTCAGGAATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGG 
TGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTGGGCATCATCCCCTATGCCGGCATC 
GACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTGAACAGCGC 
GGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGG 
CCAGCTACCCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCT 
CCGGAGGTGACCATGAGCAGCCTCTTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCT 
GTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCAGCTGTGAGCATCAGCTACGTGG 
S TCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGCCGCCCG 
^ GCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATG 
^ TGCCAACACTAAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGT 
GGGGAGAGCTGGCAGGCCCAGGGCTTGTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCC 
AGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGGCTCCGGGCTCACATGTGTAA 
GGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCTTAGT 
TCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGGCGTGC 
ATCTCCCTGTGCCCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAG 
CCCACATCCCACCCCCTCGTCCAATCCCATAATCCATGATGAAAGGTGAGGTCACGTGGCCT 
CCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGCTGTGAAGGAAGAGGAAAG 
GATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATGCT 
TGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCA 
TGGTGCTCTGAGCTGGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTG 
TCCCCACTGTGGCATGAGGGCAGTGGAGCACCATGTTTGAGGGCGAAGGGCAGAGCGTTTGT 
GTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGGACTGTTGGGAAAAGGG 
TTTTGTCCAGAAGGACAAGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACC 
CCAGCAGGGGCGCAGCGGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTT 
ATTTTGTATTTATTTGAACAGAGTTATGTCCTAACTATTTTTATAGATTTGTTTAATTAATA 
: m GCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTTGATTGTACCTTCCC 
AAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACAT 
CTGT CCAGAG AAATT C CTTTTGGG ACTGGAGG C AG AAAAGCGG CC AG AAGG CAGC AG CC CTG 
GCTCCTTTCCTTTGGCAGGTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGA 
CTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAATAACCTTGAAGGTGGAATCCAGTTATTTC 
CTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAGTGAGGTGCCTCT 
CACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCC 
CAGCCTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCC 
ATTCCACCAGAATGACCTGATGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATT 
GTTATATATGAACATATAACTGGAGTCGTCAAAAAGCAAATTAAGAAAGAATTGGACGTTAG 
AAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 206 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKD 
LDGQLDFEEFVHYLQDHEKKLiRLVFK I LDKKNDGR I DAQE IMQSLRDLGVKI SEQQAEKI LK 
SMDKNGTMT IDWNEWRDYHLLHPVEN I PE 1 1 L YWKHST I FDVGENLTVPDEFTVEERQTGMW 
WRHLVAGGGAGAVS RT CT AP LDRL KVLMQVHAS RSNNMG I VGGFTQMI REGGARSLWRGNG I 
NVLK I APES AI KFMAYEQ I KRLVGSDQETLRI HERLA AGS LAGAI AQSS I YPMEVLKTRMAL 
RKTGQYSGMLDCARRILAREGVAAFYKGYVPNMLGI 1 D YAGIDLAVYETLKKAWLQHYAVNS 
ADPGVFVLLACGTMSSTCGQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFG 
LYRGLAPNFMKVI PAVS I S YWYENLKl TLGVQSR 
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FIGURE 207 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACQCTGGGAACCTTCCCCAGCCAT 
GGCTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAG 
CAATTGCACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTC 
GCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAA 



|j ACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCA 
AAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTT 
GCTGATCAAGTGATAGTTGGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGC 
^ TGGCACCTACAAATGTTATATCATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATA 
AAACTGGAGCCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTG 
CGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCA 
GGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCA 
TGAAGGTTGTGTCTGTGCTCTACAATGTTACGATCAACAACACATACTCCTGTATGATTGAA 
AATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAGAATCGGAGATCAAAAGGCGGAG 
TCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCT 
GGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTTGGCCACAAAAAAG 
CATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCACCACCAGATATGACCTAG 
TTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTGAGCAAACAAGAGCA 
AGAAACAAAAAGAAG CCAAAAGC AG AAGGC T C C AAT AT GAAC AAGAT AAAT C TAT C TT CAAA 
GACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAG 
TAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCT 
GGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGCTG 
TAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCA 
CAAATTAAGCTGTAZTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 
GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCT 
TGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAA 
ACAGAGCAGTCGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 208 

MASLGQ ILFWS I I S I I I ILAGAIALI IGFGI SGRHS I TVTTVASAGNI GEDG ILSCTFEPD I 
KLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTD 
AGTYKCYIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVD 
QGANFSEVSNTS FELNSENVTMKWS VL YNVT INNT YS CM I END I AKATGD I KVTESE I KRR 
SHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK 
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FIGURE 209 

GAATTTGTAGAAGACAGCGGCGTTGCCATGGCGGCGTCTCTGGGGCAGGTGTTGGCTCTGGT 
GCTGGTGGCCGCTCTGTGGGGTGGCACGCAGCCGCTGCTGAAGCGGGCCTCCGCCGGCCTGC 
AGCGGGTTCATGAGCCGACCTGGGCCCAGCAGTTGCTACAGGAGATGAAGACCCTCTTCTTG 
AATACTGAGTACCTGATGCCCTTTCTCCTCAACCAGTGTGGATCCCTTCTCTATTACCTCAC 
CTTGGCATCGACAGATCTGACCCTGGCTGTGCCCATCTGTAACTCTCTGGCTATCATCTTCA 
CACTGATTGTTGGGAAGGCCCTTGGAGAAGATATTGGTGGAAAACGTAAGTTAGACTACTGC 
GAGTGCGGGACGCAGCTCTGTGGATCTCGACATACCTGTGTTAGTTCCTTCCCAGAACCCAT 
CTCCCCAGAGTGGGTGAGGACACGGCCTTTTCCCATCCTGCCCTTTCCTCTGCAGCTGTTTT 
GCTTCCTTGTGGCCATCAGAGTTCCCTTCCCCTGGACAGTCTGGAGAAAGACAGAGGCTGGG 
GTTTGGGATTGAAGACCAGACCCCATCTGAGCCCTTCCTCCAGCCCTGTACCAGCTCCTACT 
GGCATGGCTGAGCTCAGACCCTCCTGATTTCTGCCTATTATCCCAGGAGCAGTTGCTGGCAT 
GGTGCTCACCGTGATAGGAATTTCACTCTGCATCACAAGCTCAGTGAGTAAGACCCAGGGGC 
AACAGTCTACCCTTTGAGTGGGCCGAACCCACTTCCAGCTCTGCTGCCTCCAGGAAGCCCCT 
GGGCCATGAAGTGCTGGCAGTGAGCGGATGGACCTAGCACTTCCCCTCTCTGGCCTTAGCTT 
CCTCCTCTCTTATGGGGATAACAGCTACCTCATGGATCACAATAAGAGAACAAGAGTGAAAG 
AGTTTTGTAACCTTCAAGTGCTGTTCAGCTGCGGGGATTTAGCACAGGAGACTCTACGCTCA 
CCCTCAGCAACCTTTCTGCCCCAGCAGCTCTCTTCCTGCTAACATCTCAGGCTCCCAGCCCA 
GCCACCATTACTGTGGCCTGATCTGGACTATCATGGTGGCAGGTTCCATGGACTGCAGAACT 
CCAGCTGCATGGAAAGGGCCAGCTGCAGACTTTGAGCCAGAAATGCAAACGGGAGGCCTCTG 
GGACTCAGTCAGAGCGCTTTGGCTGAATGAGGGGTGGAACCGAGGGAAGAAGGTGCGTCGGA 
GTGGCAGATGCAGGAAATGAGCTGTCTATTAGCCTTGCCTGCCCCACCCATGAGGTAGGCAG 
AAATCCTCACTGCCAGCCCCTCTTAAACAGGTAGAGAGCTGTGAGCCCCAGCCCCACCTGAC 
TC CAGCAC AC CTGG CG AG T AGT AGCTGTC AAT AAAT CTATGTAAACAGAC AAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 210 

MAAS LGQVLALVLVAALWGGTQPLLKRAS AGLQR VHE P TWAQ QLLQ EMKTLF LNTE YL»MP FL 
LNQCGSLLYYLTLASTDLTLAVP I CNSLAI IFTL I VGKALGED I GGKRKLDYCECGTQLCGS 
RHTCVSSFPEPISPEWVRTRPFPILPFPLQLFCFLVAIRVPFPWTVWRKTEAGWD 
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XK 



FIGURE 211 



CTTCTGTAGGACAGTCACCAGGCCAGATCCAGAAGCCTCTCTAGGCTCCAGCTTTCTCTGTG 
GAAGATGACAGCAATTATAGCAGGACCCTGCCAGGCTGTCGAAAAGATTCCGCAATAAAACT 
TTGCCAGTGGGAAGTACCTAGTGAAACGGCCTAAGATGCCACTTCTTCTCATGTCCCAGGCT 
TGAGGCCCTGTGGTCCCCATCCTTGGGAGAAGTCAGCTCCAGCACCATGAAGGGCATCCTCG 
TTGCTGGTATCACTGCAGTGCTTGTTGCAGCTGTAGA.VTCTCTGAGCTGCGTGCAGTGTAAT 
p TCATGGGAAAAATCCTGTGTCAACAGCATTGCCTCTG^ATGTCCCTCACATGCCAACACCAG 
CTGTATCAGCTCCTCAGCCAGCTCCTCTCTAGAGACACCAGTCAGATTATACCAGAATATGT 
TCTGCTCAGCGGAGAACTGCAGTGAGGAGACACACATTACAGCCTTCACTGTCCACGTGTCT 
GCTGAAGAACACTTTCATTTTGTAAGCCAGTGCTGCCAAGGAAAGGAATGCAGCAACACCAG 
CGATGCCCTGGACCCTCCCCTGAAGAACGTGTCCAGCAACGCAGAGTGCCCTGCTTGTTATG 
AATCTAATGGAACTTCCTGTCGTGGGAAGCCCTGGAAATGCTATGAAGAAGAACAGTGTGTC 
TTTCTAGTTGCAGAACTTAAGAATGACATTGAGTCTAAGAGTCTCGTGCTGAAAGGCTGTTC 
CAACGTCAGTAACGCCACCTGTCAGTTCCTGTCTGGTGAAAACAAGACTCTTGGAGGAGTCA 
TCTTTCGAAAGTTTGAGTGTGCAAATGTAAACAGCTTAACCCCCACGTCTGCACCAACCACT 
TCCCACAACGTGGGCTCCAAAGCTTCCCTCTACCTCTTGGCCCTTGCCAGCCTCCTTCTTCG 
GGGACTGCTGCCCTGAGGTCCTGGGGCTGCACTTTGCCCAGCACCCCATTTCTGCTTCTCTG 
AGGTCCAGAGCACCCCCTGCGGTGCTGACACCCTCTTTCCCTGCTCTGCCCCGTTTAACTGC 
CCAGTAAGTGGGAGTCACAGGTCTCCAGGCAATGCCGACAGCTGCCTTGTTCTTCATTATTA 
AAGCACTGGTTCATTCACTGCCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 212 

MKGILVAGITAVLVAAVESLSCVQCNSWEKSCVNSIASECPSHANTSCISSSASSSLETPVR 
LYQNMFCSAENCSEETHITAFTVHVSAEEHFHFVSQCCQGKECSNTSDALDPPLKNVSSNAE 
CPACYESNGTSCRGKPWKCYEEEQCVFLVAELKNDIESKSLVLKGCSNVSNATCQFLSGENK 
TLGGVIFRKFECANVNSLTPTSAPTTSHNVGSKASLYLLALASLLLRGLLP 
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FIGURE 213 

GGCCTCGGTTCAAACGACCCGGTGGGTCTACAGCGGAAGGGAGGGAGCGAAGGTAGGAGGCA 
GGGCTTGCCTCACTGGCCACCCTCCCAACCCCAAGAGCCCAGCCCCATGGTCCCCGCCGCCG 
GCGCGCTGCTGTGGGTCCTGCTGCTGAATCTGGGTCCCCGGGCGGCGGGGGCCCAAGGCCTG 
ACCCAGACTCCGACCGAAATGCAGCGGGTCAGTTTACGCTTTGGGGGCCCCATGACCCGCAG 
J CTACCGGAGCACCGCCCGGACTGGTCTTCCCCGGAAGACAAGGATAATCCTAGAGGACGAGA 
^ ATGATGCCATGGCCGACGCCGACCGCCTGGCTGGACCAGCGGCTGCCGAGCTCTTGGCCGCC 
ACGGTGTCCACCGGCTTTAGCCGGTCGTCCGCCATTAACGAGGAGGATGGGTCTTCAGAAGA 
GGGGGTTGTGATTAATGCCGGAAAGGATAGCACCAGCAGAGAGCTTCCCAGTGCGACTCCCA 
ATACAGCGGGGAGTTCCAGCACGAGGTTTATAGCCAATAGTCAGGAGCCTGAAATCAGGCTG 
ACTTCAAGCCTGCCGCGCTCCCCCGGGAGGTCTACTGAGGACCTGCCAGGCTCGCAGGCCAC 
CCTGAGCCAGTGGTCCACACCTGGGTCTACCCCGAGCCGGTGGCCGTCACCCTCACCCACAG 
CCATGCCATCTCCTGAGGATCTGCGGCTGGTGCTGATGCCCTGGGGCCCGTGGCACTGCCAC 
,:| TGCAAGTCGGGCACCATGAGCCGGAGCCGGTCTGGGAAGCTGCA CGGCCTTTCCGGGCGCCT 

% TCGAGTTGGGGCGCTGAGCCAGCTCCGCACGGAGCACAAGCCTTGCACCTATCAACAATGTC 
CCTGCAACCGACTTCGGGAAGAGTGCCCCCTGGACACAAGTCTCTGTACTGACACCAACTGT 
GCCTCTCAGAGCACCACCAGTACCAGGACCACCACTACCCCCTTCCCCACCATCCACCTCAG 
AAGCAGTCCCAGCCTGCCACCCGCCAGCCCCTGCCCAGCCCTGGCTTTTTGGAAACGGGTCA 
GGATTGGCCTGGAGGATATTTGGAATAGCCTCTCTTCAGTGTTCACAGAGATGCAACCAATA 
GACAGAAACCAGAGGTAATGGCCACTTCATCCACATGAGGAGATGTCAGTATCTCAACCTCT 
CTTGCCCTTTCAATCCTAGCACCCACTAGATATTTTTAGTACAGAAAAACAAAACTGGAAAA 
CACAA 
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MVPAAGALLWVLLLNLGPRAAGAQGLTQTPTEMQRVSLRFGGPMTRSYRSTARTGLPRKTRI 
I LEDENDAMADADRLiAGPAAAELtLAATVSTGF SRS S AI ISfEEDGS SEEGW INAGKDSTSREL 
PSATPNTAGSSSTRFIANSQEPEIRLTSSLPRSPGRSTEDLPGSQATLSQWSTPGSTPSRWP 
SPSPTAMPSPEDLRLVLMPWGPWHCHCKSGTMSRSRSGKLHGLSGRLRVGALSQLRTEHKPC 
TYQQCPCNRLREECPLDTSLCTDTNCASQSTTSTRTTTTPFPTIHLRSSPSLPPASPCPALA 
FWKRVR I GLED I WNS LSS VFTEMQP I DRNQR 
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CCCGGGTCGACCCACGCGTCCGGGGAGAAAGGATGGCCGGCCTGGCGGCGCGGTTGGTCCTG 

CTAGCTGGGGCAGCGGCGCTGGCGAGCGGCTCCCAGGGCGACCGTGAGCCGGTGTACCGCGA 

CTGCGTACTGCAGTGCGAAGAGCAGAACTGCTCTGGGGGCGCTCTGAATCACTTCCGCTCCC 

GCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCTGTCGGGACGACTGTAAGTATGAGTGT 

ATGTGGGTCACCGTTGGGCTCTACCTCCAGGAAGGTCACAAAGTGCCTCAGTTCCATGGCAA 

GTGGCCCTTCTCCCGGTTCCTGTTCTTTCAAGAGCCGGCATCGGCCGTGGCCTCGTTTCTCA 

ATGGCCTGGCCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGTGCCAGCCTCCTCCCCC 

ATGTACCACACCTGTGTGGCCTTCGCCTGGGTGTCCCTCAATGCATGGTTCTGGTCCACAGT 

CTTCCACACCAGGGACACTGACCTCACAGAGAAAATGGACTACTTCTGTGCCTCCACTGTCA 

TCCTACACTCAATCTACCTGTGCTGCGTCAGGACCGTGGGGCTGCAGCACCCAGCTGTGGTC 

AGTGCCTTCCGGGCTCTCCTGCTGCTCATGCTGACCGTGCACGTCTCCTACCTGAGCCTCAT 

CCGCTTCGACTATGGCTACAACCTGGTGGCCAACGTGGCTATTGGCCTGGTCAACGTGGTGT 

GGTGGCTGGCCTGGTGCCTGTGGAACCAGCGGCGGCTGCCTCACGTGCGCAAGTGCGTGGTG 

GTGGTCTTGCTGCTGCAGGGGCTGTCCCTGCTCGAGCTGCTTGACTTCGCACCGCTCTTCTG 

GGTCCTGGATGGCCATGCCATCTGGCACATCAGCACCATCCCTGTCCACGTCCTCTTTTTCA 

GCTTTCTGGAAGATGACAGCCTGTACCTGCTGAAGGAATCAGAGGACAAGTTCAAGCTGGAC 

TGAAGACCTTGGAGCGAGTCTGCCCCAGTGGGGATCCTGCCCCCGCCCTGCTGGCCTCCCTT 

CTCCCGTCAACCCTTGAGATGATTTTCTCTTTTCAACTTCTTGAACTTGGACATGAAGGATG 

TGGGCCCAGAATCATGTGGCCAGCCCACCCCCTGTTGGCCCTCACCAGCCTTGGAGTCTGTT 

CTAGGGAAGGCCTCCCAGCATCTGGGACTCGAGAGTGGGCAGCCCCTCTACCTCCTGGAGCT 

GAACTGGGGTGGAACTGAGTGTGTTCTTAGCTCTACCGGGAGGACAGCTGCCTGTTTCCTCC 

CCACCAGCCTCCTCCCCACATCCCCAGCTGCCTGGCTGGGTCCTGAAGCCCTCTGTCTACCT 

GGGAGACCAGGGACCACAGGCCTTAGGGATACAGGGGGTCCCCTTCTGTTACCAGCCCCCAC 

CCTCCTCCAGGACACCACTAGGTGGTGGTGGATGCTTGTTCTTTGGCCAGCCAAGGTTCACG 

GCGATTCTCCCCATGGGATCTTGAGGGACCAAGCTGCTGGGATTGGGAAGGAGTTTCACCCT 

GACCGTTGCCCTAGCCAGGTTCCCAGGAGGCCTCACCATACTCCCTTTCAGGGCCAGGGCTC 

CAGCAAGCCCAGGGCAAGGATCCTGTGCTGCTGTCTGGTTGAGAGCCTGGCACCGTGTGTCG 

GGAGTGTGGGCCAGGCTGAGTGCATAGGTGACAGGGCCGTGAGCATGGGCCTGGGTGTGTGT 

GAGCTCAGGCCTAGGTGCGCAGTGTGGAGACGGGTGTTGTCGGGGAAGAGGTGTGGCTTCAA 

AGTGTGTGTGTGCAGGGGGTGGGTGTGTTAGCGTGGGTTAGGGGAACGTGTGTGCGCGTGCT 

GGTGGGCATGTGAGATGAGTGACTGCCGGTGAATGTGTCCACAGTTGAGAGGTTGGAGCAGG 

ATGAGGGAA'jTCCTGTCACCATCAATAATCACTTGTGGAGCGCCAGCTCTGCCCAAGACGCCA 

CCTGGGCGGACAGCCAGGAGCTCTCCATGGCCAGGCTGCCTGTGTGCATGTTCCCTGTCTGG 

TGCCCCTTTGCCCGCCTCCTGCAAACC± CACAGGGTCCCCACACAACAGTGCCCTCCAGAAG 

CAGGCCCTCGGAGGCAGAGGAAGGAAAATGGGGATGGCTGGGGCTCTCTCCATCCTCCTTTT 

CTCCTTGCCTTCGCATGGCTGGCCTTCCCCTCCAAAAGCTCCATTCCCCTGCTGCCAGCCCC 

TTTGCCATAGCCTGATTTTGGGGAGGAGGAAGGGGCGATTTGAGGGAGAAGGGGAGAAAGCT 

TATGGCTGGGTCTGGTTTCTTCCCTTCCCAGAGGGTCTTACTGTTCCAGGGTGGCCCCAGGG 

CAGGCAGGGGCCACACTATGCCTGTGCCCTGGTAAAGGTGACCCCTGCCATTTACGAGCAGC 

CCTGGCATGTTCCTGCCCCACAGGAATAGAATGGAGGGAGCTCCAGAAACTTTCCATCCCAA 

AGGCAGTCTCCGTGGTTGAAGCAGACTGGATTTTTGCTCTGCCCCTGACCGCTTGTCCCTCT 

TTGAGGG AGGGG AGCT ATGCTAGGACTC CAAC CT CAGGGACT CGGGTGGCCTGCGCTAGCTT 

CTTTTGATACTGAAAACTTTTAAGGTGGGAGGGTGGCAAGGGATGTGCTTAATAAATCAATT 

CCAAGCCTCAAAAAAAAAAAAAAAAA 
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FIGURE 216 

MAGL AARLVLLAGAAAIiASGSQGDREP VYRDC VLQCEE QNCS GG ALiNH FR S RQ P I YMS LAGW 
TCRDDCKYECMWVTVGLYLQEGHKVPQFHGKWPFSRFLFFQEPASAVASFLNGLASLVMLCR 
YRTFVPASSPMYHTCVAFAWVSLNAWFWSTVFHTRDTDLTEKMDYFCASTVILHSIYLCCVR 
TVGLQHPAVVSAFRAIjLLLMLTVHV^ 

RLPHVRKCWWLLLQGLSLLELLDFPPLFWVLDAHAIWHISTIPVHVLFFSFLEDDSLYLL 
KESEDKFKLD 
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FIGURE 217 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGTGA 
4 CTATGGCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTG 
CTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGAC 
CAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAGGAGC 

i TGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCGCTTCAGCCAGGG 

I 

CAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAA 
ACTCCAATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGATATCAACACCA 
ACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAGGGGGCAGAGATG 
GAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTCTTCCGCCCCATTGA 
GGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGG 
TACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTC 
TTTGATCTTGAATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGG 
H TCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGT 
^ TTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGA 
GCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGT 
CCTGTTTGCACTGTGCTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGC 
TCGGGGGGCTGCAGGTCCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTG 
CGCGTGGTCACACTGCTCTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGA 
GCTGACCCAGGAGATGTCCCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCGTGCCAG 
GCCTGTGGGAACAGGGCTGGTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGAT 
GCCCGTGAGAAGGTGCTGCAGACACTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCG 
TCAGGACCCCCAGCTGGGCAGGACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCA 
^ GCCTGGAGCTG?AGGATGGTGAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAAC 
AGCTTGCTGAAGGAGCTGAGATGAGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGA 
GGCTGAGGGGTGCCAGCGTGGGTGGGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCT 
TGGCCATTAAATGGAAACCTGAAGGCCAAAAAAAAAA^^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 218 

MAPQSLPSSRHS\PLGMLLGLLMAACFTFCLSHQHLKEFALTNPEKSSTKETERKETKAEEEL 
DAEVLEVFHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKLQYEDKFRNNLKGKRLDINTO 
TYTSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDEL3STWIETDMQI1W 
RLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEPLVKEYAAF 
VLGAAFSSNPKVQVEAIEGGALQKL.LVILATEQPLTAKKKVLiFALCSLLRHFPYAQRQFIiKL 
GGLQ VLRTLVQE KGTE VLAVRWTLL YDLVTE KMFAE EE AELTQEMS PE KLQQ YRQVHLL PG 
LWEQGWCEITAHLLAL.PEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTLASLQAEYQVLAS 
LELQDGEDEGYFQELLGSVNSLLKELR 
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FIGURE 219 

TTCGGCTTCCGTAGAGGAAGTGGCGCGGACCTTCATTTGGGGTTTCGGTTCCCCCCCTTCCC 
CTTCCCCGGGGTCTGGGGGTGACATTGCACCGCGCCCCTCGTGGGGTCGCGTTGCCACCCCA 
CGCGGACTCCCCAGCTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCC 
TTCCCACCTGACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGC 
CCGGCCTTCGCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGT 
CGCAGGGGCATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGG 
TCCATGTGACCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCT 
GTCTCTGTCCTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGA 
TGAAGGGTTAGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCT 
ATGTTTCTGGTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCT 
GATGCACTTGGGCCAGGTGTGGTTGGGATCCATGGAGACTCACCCTATTACTTCCTGACTTC 
AGCCTTTCTGACAGCAGCCATTATCCTGCTCCATACCTTTTGGGGAGTTGTGTTCTTTGATG 
CCTGTGAGAGGAGACGGTACTGGGCTTTGGGCCTGGTGGTTGGGAGTCACCTACTGACATCG 
GGACTGACATTCCTGAACCCCTGGTATGAGGCCAGCCTGCTGCCCATCTATGCAGTCACTGT 
TTCCATGGGGCTCTGGGCCTTCATCACAGCTGGAGGGTCCCTCCGAAGTATTCAGCGCAGCC 
TCTTGTGTAAGGACTGACTACCTGGACTGATCGCCTGACAGATCCCACCTGCCTGTCCACTG 
CCCATGACTGAGCCCAGCCCCAGCCCGGGTCCATTGCCCACATTCTCTGTCTCCTTCTCGTC 
GGTCTACCCCACTACCTCCAGGGTTTTGCTTTGTCCTTTTGTGACCGTTAGTCTCTAAGCTT 
TACCAGGAGCAGCCTGGGTTCAGCCAGTCAGTGACTGGTGGGTTTGAATCTGCACTTATCCC 
CACCACCTGGGGACCCCCTTGTTGTGTCCAGGACTCCCCCTGTGTCAGTGCTCTGCTCTCAC 
CCTGCCCAAGACTCACCTCCCTTCCCCTCTGCAGGCCGACGGCAGGAGGACAGTCGGGTGAT 
GGTGTATTCTGCCCTGCGCATCCCACCCGAGGACTGAGGGAACCTAGGGGGGACCCCTGGGC 
CTGGGGTGCCCTCCTGATGTCCTCGCCCTGTATTTCTCCATCTCCAGTTCTGGACAGTGCAG 
GTTGCCAAGAAAAGGGACCTAGTTTAGCCATTGCCCTGGAGATGAAATTAATGGAGGCTCAA 
GGATAGATGAGCTCTGAGTTTCTCAGTACTCCCTCAAGACTGGACATCTTGGTCTTTTTCTC 
AGGCCTGAGGGGGAACCATTTTTGGTGTGATAAATACCCTAAACTGCCTTTTTTTCTTTTTT 
GAGGTGGGGGGAGGGAGGAGGTATATTGGAACTCTTCTAACCTCCTTGGGCTATATTTTCTC 
TCCTCGAGTTGCTCCTCATGGCTGGGCTCATTTCGGTCCCTTTCTCCTTGGTCCCAGACCTT 
GGGGGAAAGGAAGGAAGTGCATGTTTGGGAACTGGCATTACTGGAACTAATGGTTTTAACCT 
CCTTAACCACCAGCATCCCTCCTCTCCCCAAGGTGAAGTGGAGGGTGCTGTGGTGAGCTGGC 
CA CT CC AGAG CTGC AGTGCC AC TGGAGGAGTC AGACT AC C ATGACAT C GT AGGGAAGG AGGG 
GAGATTTTTTTGTAGTTTTTAATTGGGGTGTGGGAGGGGCGGGGAGGTTTTCTATAAACTGT 
ATCATTTTCTGCTGAGGGTGGAGTGTCCCATCCTTTTAATCAAGGTGATTGTGATTTTGACT 
AATAAAAAAGAATTTGTAAAAAAAAAA ^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 220 

MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVTDR 
SDARLQYGLL I FGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRS PIS IRQMAYVSGLS 
FG 1 1 SGVFS V INI LAD ALGP GVVG I HGD S P Y Y FLTS AF LT AA 1 1 LLHT FWGWF FDACERRR 
YWALGLWGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQRSLLCKD 
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FIGURE 221 



AAGCTGGTTTAAGGAAGCAGAGGAGGGTTAGATTCGTTGAGTGAGGACGGAAGATCAACCCA 
TTTCCATTCCGCCAGATGGCCTATGTTTCTGGTCTCTCCCTTCGGNATCATCAGTGGTGTNT 
TNTCTGTTATCAATATTTTGGCTGATGCANTTGGGCCAGGTGTGGTTGGGATCCATGGAGAC 
TCACCCTATTANTTCCTGANTTCAGCCTTTNTGACAGCAGCCATTATCCTGCTC 
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FIGURE 222 

gaccgaccgttcagatgcccggttccagtacggcttcctgatttttggtgctgctgtntctg 
tccttctacaggaggtgttccgctttgcctantacaagctgcttaagaaggcagatgagggg 
ttagcatngctgagtgaggacggaagatcacccatttccatccgccagatggcctatgtttn 
tggtntttccttcggtatcatcagtggtgttttntctgttatcaatattttggntgatgcan 
ttgggccaggtgtggttgggatccatggagantcacc*:tattaattcctgaattcagccttt 
ntgacagcagccattatcctgntccataccttttggg^agttgtgttttttgatgcctgtga 

GAGGAG 
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FIGURE 223 

NGTTGGAGAAGTGGCGCGGACNTTCATTTGGGGTTTCGGTTTCCCCCCTTTCCCTTTCCCCG 
GGGTCTGGGGTGACATTGCACGGGCCCCTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCC 
CCAGNTGGNGCGCCCTTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACNTG 
ACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGCCCGGCCTTCG 
i CGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGCA 

i 

TTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGAC 
CGACCGGTCAGATGGCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTCC 
TTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTTA 
GCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTGG 
TCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTTG 
GGCCAGGTGTGGTTGGGATCCATGGAGACTCACCC 

M 
% 
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FIGURE 224 

GTAAAAGAAAGTGGCCGGACCTTCATTGGGGTTTCGGTTCCCCCCTTTCCCNTTCCCCGGGG 
TCTGGGGGTGACATTGCACCGCGCCCNTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCCC 
CAGNTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACCTGA 
CCAGCCATGGGGGCTGCGGTGTTTTTCGGGCTGCACTTTCGTCGCGTTCGGGCCCGGCCTTC 
GCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGC 
ATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGA 
CCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTC 
CTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTT 
AGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTG 
GTCTCTCGTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCAGTT 
GGGCCAGGTGTGGTTGGGATCCATGGAGAC 



2?& 
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GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAG 
TGGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAAATCCCCCACCCCC 
TACCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGG 
TGTCTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCT 

| AGACTCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGT 

I 

- CCTTGTGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCCATGGCTACAGCAAGACCCC 
CCTGGATGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTT 
CTCGCCAACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCA 

- GGACCTGGGAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCA 
ATGGATCCGACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAAC 
CAGCTCTACTGCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAG 
GAAGAAAGTTTTCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGC 

|| AGCAGATGTTCCAGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCT 
AACGACCTCATGCTCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCAT 
CAACGTCTCCTCTCATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAA 
CCAAGAGCCCCCAAGTGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGT 
CAGAAAAGGTGCGAGGATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGA 
CAAAGCAGGTAGAGACTCCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCC 
TGCAGGGACTCGTGTCCTGGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTAC 
ACGAACCTCTGC7VAGTTCACCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCAT 
CCGAGGACTCAGCACACCGGCATCCCCACCTGCTGCAGGGACAGCCCTGAGACTCCTTTCAG 
ACCCTCATTCCTTCCCAGAGATGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCT 

^ CCTGGACTCAGGG^CTGCTTCCCCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGG 
GAACAATTTCCAAAACTGTCCAGGGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCAT 
CCTCAAGCTCAGGGCCCATCCCTTCTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAAA 
CTGAGAAGTGGAAAAAAAAA 
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FIGURE 226 

^TARPPVMWVLCALITALLLGVTEHVLANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDD 
SSSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLS 
PVYESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCL 
VSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQ IDDTMFCAGDKAGRDSCQGDSGGP 

.| WCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 

I 
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FIGURE 227 

ATGGTCAACGACCGGTGGAAGACCATGGGCGGCGCTGCCCAACTTGAGGACCGGCCGCGCGA 
CAAGCCGCAGCGGCCGAGCTGCGGCTACGTGCTGTGCACCGTGCTGCTGGCCCTGGCTGTGC 
TGCTGGCTGTAGCTGTCACCGGTGCCGTGCTCTTCCTGAACCACGCCCACGCGCCGGGCACG 
GCGCCCCCACCTGTCGTCAGCACTGGGC^-lGCCAGCGCCAACAGCGCCCTGGTCACTGTGGA 
^ AAGGGCGGACAGCTCGCACCTCAGCATCCTCATTGACCCGCGCTGCCCCGACCTCACCGACA 
GCTTCGCACGCCTGGAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGACAGAGCACCAGGCC 
CAGCCACGGCTGGTGGGCGACCAGGAGCAGGAGCTGCTGGACACGCTGGCCGACCAGCTGCC 
M CCGGCTGCTGGCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGGGGCTGCGGAAGGGGCATG 
f GCACGCTGGGCCAGGGCCTCAGCGCCCTGCAGAGTGAGCAGGGCCGCCTCATCCAGCTTCTC 
^ TCTGAGAGCCAGGGCCACATGGCTCACCTGGTGAACTCCGTCAGCGACATCCTGGATGCCCT 
GCAGAGGGACCGGGGGCTGGGCCGGCCCCGCAACAAGGCCGACCTTCAGAGAGCGCCTGCCC 
GGGGAACCCGGCCCCGGGGCTGTGCCACTGGCTCCCGGCCCCGAGACTGTCTGGACGTCCTC 
CTAAGCGGACAGCAGGACGATGGCGTCTACTCTGTCTTTCCCACCCACTACCCGGCCGGCTT 
' CCAGGTGTACTGTGACATGCGCACGGACGuOGGCGGCTGGACGGTGTTTCAGCGCCGGGAGG 
ACGGCTCCGTGAACTTCTTCCGGGGCTGGGACGCGTACCGAGACGGCTTTGGCAGGCTCACC 
GGGGAGCACTGGCTAGGGCTCAAGAGGATCCACGCCCTGACCACACAGGCTGCCTACGAGCT 
GCACGTGGACCTGGAGGACTTTGAGAATGGC^CGGCCTATGCCCGCTACGGGAGCTTCGGCG 
TGGGCTTGTTCTCCGTGGACCCTGAGGAAGACGGGTACCCGCTCACCGTGGCTGACTATTCC 
GGCACTGCAGGCGACTCCCTCCTGAAGCACAGCGGCATGAGGTTCACCACCAAGGACCGTGA 
CAGCGACCATTCAGAGAACAACTGTGCCGCCTTCTACCGCGGTGCCTGGTGGTA.CCGCAACT 
GCCACACGTCCAACCTCAATGGGCAGTACCTGCGCGGTGCGCACGCCTCCTATGCCGACGGC 
GTGGAGTGGTCCTCCTGGACCGGCTGGCAGTACTCACTCAAGTTCTCTGAGATGAAGATCCG 
M GCCGGTCCGGGAGGACCGCTAGACTGGTGCACCTTGTCCTTGGCCCTGCTGGTCCCTGTCGC 
5@S CCCATCCCCGACCCCACCTCACTCTTTCGTGAATGTTCTCCACCCACCTGTGCCTGGCGGAC 
if CCACTCTCCAGTAGGGAGGGGCCGGGCCATCCCTGACACGAAGCTCCCTGGGCCGGTGAAGT 
CACACATCGCCTTCTCGCCGTCCCCACCCCCTCCATTTGGCAGCTCACTGATCTCTTGCCTC 
TGCTGATGGGGGCTGGCAAACTTGACGACCCCAACTCCTGCCTGCCCCCACTGTGACTCCGG 
TGCTGTTTGCCGTCCCCTGGCCAGGATGGTGGAGTCTGCCCCAGGCACCCTCTGCCCTGCCC 
GGCCAAATACCCGGCATTATGGGGACAGAGAGCAGGGGGCAGACAGCACCCCTGGAGTCCTC 
CTAGCAGATCGTGGGGAATGTCAGGTCTCTCTGAGGTCAGGTCTGAGGCCAGTATCCTCCAG 
CCCTCCCAATGCCAACCCCCACCCCGTTTCCCTGGTGCCCAGAGAACCCACCTCTCCCCCAA 
GGGCCTCAGCCTGGCTGTGGGCTGGGTGGCCCCATCCTACCAGGCCCTGAGGTCAGGATGGG 
GAGCTGCTGCCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCTGGAGGCCACCCAC 
CCTGTGCCCCGGCAGGCCTGGGGTCTGCAGTCCTCTTACCTGCTGTGCCCACCTGCTCTCTG 
TCTCAAATGAGGCCCAACCCATCCCCCACCCAGCTCCCGGCCGTCCTCCTACCTGGGGCAGC 
CGGGGCTGCCATCCCATTTCTCCTGCCTCTGGAAGGTGGGTGGGGCCCTGCACCGTGGGGCT 
GGACTGCGCTAATGGGAAGCTCTTGGTTTTCTGGGCTGGGGCCTAGGCAGGGCTGGGATGAG 
GCTTGTACAACCCCCACCACGAATTTCCCAGGGACTCCAGGGTCCTGAGGCCTCCCAGGAGG 
GCCTTGGGGGTGATGACCCGTTCCCTGAGGTGGCTGTCTCCATGAGGAGGCCAACCCTTGCC 
ATTGACCGTGGCCACCTGGACCCAGGCCAGGCCCGGCCCGGCGAGTGGTCAAGGGACAGGGA 
CCACCTCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCAGACCAGGCACCACCTGGACA 
CTTTCTTGTTGAATCCTCCCAACACCCAGCACGCTGTCATCCCCACTCCTTGTGTGCACACA 
m TGCAGAGGTGAGACCCGCAGGCTCCCAGGACCAGCAGCCACAAGGGCAGGGCTCjGAGCCGGG 
TCCTCAGCTGTCTGCTCAGCAGCCCTGGACCCGCGTGCGTTACGTCAGGCCCAGATGCAGGG 
CGGCTTTTCCAAGGCCTCCTGATGGGGGCCTCCGAAAGGGCTGGAGTCAGCCTTGGGGAGCT 
GCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAGGTGGCTTCCTCCAAAGGACACCCGATGGCA 
"1 GGTGCCTAGGGGGTGTGGGGTTCCGTTCTCCCTTCCCCTCCCACTGAAGTTTGTGCTTAAAA 
AACAATAAATTTGACTTGGCACCACTGGGGGTTGGTGGGAGAGGCCGTGTGACCTGGCTCTC 
TGTCCCAGTGCCACCAGGTCATCCACATGCGCAG 
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IVIVNDRWKTMGGAAQLEDRPRDKPQRPSCGYVLCTVLLALiAVLLAVAVTGAVLFLNHAHAPGT 
APPPWSTGAASANSALVTVERADSSHLSILIDPRCPDLTDSFARLESAQASVLQALTEHQA 
QPRLVGDQEQELLDTLADQLPRLLARASELQTECMGLRKGHGTLGQGLSALQSEQGRLIQLL 
SESQGHMAHLVNSVSDILDALQRDRGLGRPRNKADLQRAPARGTRPRGCATGSRPRDCLDVL 
LSGQQDDGVYSVFPTHYPAGFQVYCDMRTDGGGWTVFQRREDGSVNFFRGWDAYRDGFGRLT 
GEHWLGLKRIHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPEEDGYPLTVADYS 
GTAGDSLLKHSGMRFTTKDRDSDHSENNCAAFYRGAWWYRNCHTSNLNGQYLRGAHASYADG 
VE WS S WTGWQYS LKFSEMKIR PVREDR 
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FIGURE 229 

GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCT 
TGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCACTCT 
CACCGCTGTAGGAATCCAGASSCAGGCCAAGTACAGCAGCACGAGGGACATGCTGGATGATG 
ATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACTCGGCATC CAGAGCCC 
1 CGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGACCCTGCTGAC 
%; TTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACC 
AGCTCTCCAATACTGGTCAAGACACCATTTCT CAAATGGAAGAAAGATTAGGAAATACGTCC 
CAAGAGTTGCAATCTCTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGC 
TGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGCACACAGGTGCAGCCCTTGTACAG 
AACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATAAAGACAGCAAAAGTTGGGAG 
GACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAACAAGAAGA 
CCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTT 
|| TGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTG 
* TTCCATATTATAATAGATGTCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGG 
GATGATCTTCTCAAAGGACTGCAAAGAATTGAAG CGTTGTGT CTGTGAGAGAAGGGCAGGAA 
TGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGCGAAGGTGACTGATTCGCC 
CTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAGACAT 
TGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTG 
TTTCCTGTTCAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACA 
AGAAGTCTTATTTACATGCCACCAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTG 
GCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTTTTCAT 
GTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAATACATTGAGGTAAC 
m ATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAG^TTGTACC 
AG C AAAT AC AC AAGG AATT CTTTTTGTTTGTTTC AGTT CATA CTAGTC C C TT C C CAAT C CAT 
CAGTAAAGACCCCATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAG 
AATCTCAAATCTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTG 
TCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATATATCCCCATCTCCGTTTCATATCAG 
AACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGTGAGCCTCTTCA 
TCTGCACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAG 
^ GTACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 
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FIGURE 230 

MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLL 
IGLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCRE 
LYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAAS 
QSYSEFFYSYWTGLLRPDSGKAWLWMDGTP FTSELFHI 1 1 DVTS PRSRDCVAI LNGMI FSKD 
CKELKRCVCERRAGMVKPESLHVPPETLGEGD 
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FIGURE 231 

AATTTTCACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGNTGG 
ATGATGATGGGAC ACC AC CAT GAGC CTGC ATTNT C AAG C T TT TG C C ACAATT CGGCAT C C AG 
AGCCCCGGCGCACAGAGCACAGGGNTCCTTTTTCAACGTGGCGACCAGTGGCCCTGACCCTG 
CTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTA 
CTACCAGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATA 
CGTCCCAAGAGTTGCAATTTNTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTNTGCAGCAT 
GTGG CTGAAAAACT CTGT CG TG AGCTGT ATAACAAAG CTGG AGG AACTTTGAAGGAGGGC AA 
AG TNT C CT C ATNT ACT AT AC AC AC AC CACTTCCC 
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FIGURE 232 

GCCGAGCGCAAGAACCCTGCGCAGCCCAGAGCAGCTGCTGGAGGGGAATCGAGGCGCGGCTC 
CGGGGATTCGGCTCGGGCCGCTGGCTCTGCTCTGCGGGGAGGGAGCGGGCCCGCCCGCGGGG 
CCCGAGCCCTCCGGATCCGCCCCCTCCCCGGTCCCGCCCCCTCGGAGACTCCTCTGGCTGCT 
CTGGGGGTTCGCCGGGGCCGGGGACCCGCGGTCCGGGCGCCATGCGGGCATCGCTGCTGCTG 
i TCGGTGCTGCGGCCCGCAGGGCCCGTGGCCGTGGGCATCTCCCTGGGCTTCACCCTGAGCCT 
GCTCAGCGTCACCTGGGTGGAGGAGCCGTGCGGCCCAGGCCCGCCCCAACCTGGAGACTCTG 
AGCTGCCGCCGCGCGGCAACACCAACGCGGCGCGCCGGCCCAACTCGGTGCAGCCCGGAGCG 
| GAGCGCGAGAAGCCCGGGGCCGGCGAAGGCGCCGGGGAGAATTGGGAGCCGCGCGTCTTGCC 
i CTACCACCCTGCACAGCCCGGCCAGGCCGCCAAAAAGGCCGTCAGGACCCGCTACATCAGCA 
CGGAGCTGGGCATCAGGCAGAGGCTGCTGGTGGCGGTGCTGACCTCTCAGACCACGCTGCCC 
ACGCTGGGCGTGGCCGTGAACCGCACGCTGGGGCACCGGCTGGAGCGTGTGGTGTTGCTGAC 
GGGCGCACGGGGCCGCCGGGCCCCACCTGGCATGGCAGTGGTGACGCTGGGCGAGGAGCGAC 
CCATTGGACACCTGCACCTGGCGCTGCGCCACCTGCTGGAGCAGCACGGCGACGACTTTGAC 
TGGTTCTTCCTGGTGCCTGAGACCACCTACACCGAGGCGCACGGCCTGGCACGCCTAACTGG 
CCACCTCAGCCTGGCCTCCGCCGCCCACCTGTACCTGGGCCGGCCCCAGGACTTCATCGGCG 
GAGAGCCCACCCCCGGCCGCTACTGCCACGGAGGCTTTGGGGTGCTGCTGTCGCGCATGCTG 
CTGCAACAACTGCGCCCCGACCTGGAAGGCTGCCGCAACGACATCGTCAGTGCGCGCCCTGA 
CGAGTGGCTGGGTCGCTGCATTCTCGATGCCACCGGGGTGGGCTGCACTGGTGACCACGAGG 
GGGTGGACTATAGCCATCTGGAGCTGAGCCCTGGGGAGCCAGTGCAGGAGGGGGACCCTCAT 
TTCCGAAGTGCCCTGACAGCCCACCCTGTGCGTGACCCTGTGCACATGTACCAGCTGCACAA 
& AGCTTTCGCCCGAGCTGAACTGGAACGCACGTACCAGGAGATCCAGGAGTTACAGTGGGAGA 
f§ TCCAGAATACCAGCCATCTGGCCGTTGATGGGGACCGGGCAGCTGCTTGGCCGGTGGGTATT 
^ CCAGCACCATCCCGCCCGGCCTCCCGCTTTGAGGTGCTGCGCTGGGACTACTTCACGGAGCA 
GCACGCTTTCTCCTGCGCCGATGGCTCACCCCGCTGCCCACTGCGTGGGGCTGACCGGGCTG 
ATGTGGCCGATGTTCTGGGGACAGCTCTAGAGGAGCTGAACCGCCGCTACCACCCGGGCTTG 
CGGCTCCAGAAGCAGCAGCTGGTGAATGGCTACCGACGCTTTGATCCGGCCCGGGGTATGGA 
ATACACGCTGGACTTGCAGCTGGAGGCACTGACCCGCCAGGGAGGCCGCCGGCCCGTCACTC 
GCCGAGTGCAGCTGCTCCGGCCGCTGAGCCGCGTGGAGATCTTGCCTGTGCCCTATGTCACT 
GAGGCCTCACGTCTCACTGTGCTGCTGCCTCTAGCTGCGGCTGAGCGTGACCTGGCCCCTGG 
CTTCTTGGAGGCCTTTGCCACTGCAGCAGTGGAGCCTGGTGATGCTGCGGCAGCCCTGACCC 
TGCTGCTACTGTATGAGCCGCGCCAGGCCCAGCGCGTGGCCCATGCAGATGTCTTCGCACCT 
GTCAAGGCCCACGTGGCAGAGCTGGAGCGGCGTTTCCCCGGTGCCCGGGTGCCATGGCTCAG 
TGTGGAGACAGCCGGACCCTCACCACTGCGCCTCATGGATCTACTCTCCAAGAAGCACCCGC 
TGGACACACTGTTCCTGCTGGCCGGGCCAGACACGGTGCTCACGCCTGACTTCCTGAACCGC 
TGCCGCATGCATGCCATCTCCGGCTGGCAGGCCTTCTTTCCCATGCATTTCCAAGCCTTCCA 
CCCAGGTGTGGCCCCACCACAAGGGCCTGGGCCCCCAGAGCTGGGCCGTGACACTGGCCGCT 
TTGATCGCCAGGCAGCCAGCGAGGCCTGCTTCTACAACTCCGACTACGTGGCAGCCCGTGGG 
CGCCTGGCGGCAGCCTCAGAACAAGAAGAGGAGCTGCTGGAGAGCCTGGATGTGTACGAGCT 
GTTCCTCCACTTCTCOAGTCTGCATGTGCTGGGGGCGGTGGAGCCGGCGCTGCTGCAGCGCT 
ACCGGGCCCAGACGTGCAGCGCGAGGCTCAGTGAGGACCTGTACCACCGCTGCCTCCAGAGC 
GTGCTTGAGGGCCTCGGCTCCCGAACGCAGCTGGCCATGCTACTCTTTGAACAGGAGCAGGG 
% CAACAGCACCTGACCCCACCCTGTCCCCGTGGGCCGTGGCATGGGCACACCCCACCCGACTT 
CTCCCCCAAAACCAGAGCCACCTGCCAGCCTCGCTGGGCAGGGCTGGCCGTAGCCAGACCCC 
AAGCTGGCCCAGTGGTCCCCTGTCTGGCTCTGTGGGTCCCTGGGCTCTGGACAAGCACTGGG 
GGAGGTGCCGCCAGAGCCACCCACTTCTCATCCCAAACCCAGTTTCCCTGCCCCGTGACGCT 
GCTGATTCGGGCTGTGGCGTCCAGGTATTTATGCAGTACAGTCTGCCTGACGCCAGCCGTGC 
CTCTGGGCCCTGGGGGGTGGGCTGTAGAAGAGTTGTTGGGGAAGGAGGGAGCTGAGGAGGGG 
GCATCTCCCAACTTCTCCCTTTTGGACCCTGCCGAAGCTCCCTGCCTTTAATAAACTGGCCA 
AGTGTGGAAAAA 



BNSDOC1D: <WO 9963088 A2 I. _> 



ft 



WO 99/63088 / PCT/US99/12252 

FIGURE 233 

MRASLLLSVLRPAGPVAVGISLGFTLSLLSVTWVEEPCGPGPPQPGDSEliPPRGNTWAARRP 
NSVQPGAEREKPGAGEGAGENWEPRVLPYHPAQPGQAAKKAVRTRYISTELGIRQRLLVAVL 
TSQTTLPTLGVAWRTLGHRLERWFLTGARGRRAPPG^WTLGEERPIGHLHLALRHLLE 
QHGDDFDWFFLVPDTTYTEAHGLARLTGHLSLASAAHLYLGRPQDFIGGEPTPGRYCHGGFG 
I VLLSRMLIiQQLRPKLEGCRNDIVSARPDEWLGRCILDATGVGCTGDHEGVHYSHLELSPGEP 
VQEGDPHFRSALTAHPVRDPVHMYQLHKAFARAELERTYQEIQELQWEIQNTSHLAVDGDRA 
AAWPVGIPAPSRPASRFEVLRWDYFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELN 
RRYHPALRLQKQQLVNGYRRFDPARGMEYTLDLQLEALTPQGGRRPLTRRVQLLRPLSRVEI 
w LPVPYVTEASRLTVLLPLAAAERDLAPGFLEAFATAALEPGDAAAALTLLLLYEPRQAQRVA 
HADVFAP VKAHVAELERRF PGARVPWLSVQTAAP S PLRLMDLLS KKHPLDTLFLLAGPDT VL 
TPDFLNRCRMHAISGWQAFFPMHFQAFHPGVAPPQGPGPPELGRDTGRFDRQAASEACFYNS 
DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLHVLRAVEPALLQRYRAQTCSARLSEDL 
YHRCLQSVLEGLGSRTQLAMLtLFEQEQGNST 
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GCTCTGGCCGGCCCCGGCGATTGGTCACCGCCCGCTAGGGGACAGCCCTGGCCTCCTCTGAT 
TGGCAAGCGCTGGCCACCTCCCCACACCCCTTGCGAACGCTCCCCTAGTGGAGAAAAGGAGT 
AGCTATTAGCCAATTCGGCAGGGCCCGCTTTTTAGAAGCTTGATTTCCTTTGAAGATGAAAG 
^ ACTAGCGGAAGCTCTGCCTCTTTCCCCAGTGGGCGAGGGAACTCGGGGCGATTGGCTGGGAA 
^ CTGTATCCACCCAAATGTCACCGATTTCTTCCTATGCAGGAAATGAGCAGACCCATCAATAA 
■| GAAATTTCTCAGCCTGGCCGAAAATGGTTGGCCCCACGAAGCCACGACAACTGGAGGCAAAG 
AGGGTTGCTCAACGCCCCGCCTCATTGGAAAACCAAATCAGATCTGGGACCTATATAGCGTG 
GCGGAGGCGGGGCGATGATTGTCGCGCTCGCACCCACTGCAGCTGCGCACAGTCGCATTTCT 
TTCCCCGCCCCTGAGACCCTGCAGCACCATCTGTC ATG GCGGCTGGGCTGTTTGGTTTGAGC 
GCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGCTCCCGGCCGCCCGCGTCCGCTGGGA 
ATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGTGGCGGGAAAGCGGCCCCCAGAAC 
CGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGACGAAAACTTGTATGAGAAGAACCCA 
GACTCCCATGGTTATGACAAGGACCCCGTTTTGGACGTCTGGAACATGCGACTTGTCTTCTT 
CTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACCTTTGTGGCCTATCTGCCTGACTACA 
^ GGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCTTGTGAAATACCGAGAGGCCAATGGC 
CTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCAAGATCCAGCTGCCAGAGGATGAGTG 
ACCAGTTGCTAAGTGGGGCTCAAGAAGCACCGCCTTCCCCACCCCCTGCCTGCCATTCTGAC 
CTCTTCTCAGAGCACCTAATTAAAGGGGCTGAAAGTCTGAA 
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MAAGLFGLSARRLLAAAATRGLPAARVRWESSFSRTWAPSAVAGKRPPEPTTPWQEDPEPE 
DENLYEKNPDSHGYDKDPVLDVWNMRLVFFFGVS 1 1 LVLGSTFVAYLPDYRMKEWSRREAER 
LVKYREANGLPIMESNCFDPSKIQLPEDE 



1 
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GGCGGCTGGGCTGTTTGGTTTGAGCGCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGC 
TCCCGGCCGCCCGCGTCCGCTGGGAATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCT 
GTGGCGGGAAAGCGGCCCCCAGAACCGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGA 
CGAAAACTTGTATGAGAAGAACCCAGACTCCCATGGTTATGACAAGGACCCCGTTTTGGACG 
TCTGGAACATGCGACTTGTCTTCTTCTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACC 
TTTGTGGCCTATCTGCCTGACTACAGGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCT 
TGTGAAATACCGAGAGGCCAATGGCCTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCA 
AGATCCAG 
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GCGGCGGCTATGCCGCTTGCTCTGCTCGTCCTGTTGCTCCTGGGGCCCGGCGGCTGGTGCCT 
TGCAGAACCCCCACGCGACAGCCTGCGGGAGGAACTTGTCATCACCCCGCTGCCTTCCGGGG 
ACGTAGCCGCCACATTCCAGTTCCGCACGCGCTGGGATTCGGAGCTTCAGCGGGAAGGAGTG 
, v; . TCCCATTACAGGCTCTTTCCCAAAGCCCTGGGGCAGCTGATCTCCAAGTATTCTCTACGGGA 
GCTGCACCTGTCATTCACACAAGGCTTTTGGAGGACCCGATACTGGGGGCCACCCTTCCTGC 
AGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTTCCAAGACACTGTCACTGATGTGGATAAA 
* TCTTGGAAGGAGCTCAGTAATGTCCTCTCAGGGATCTTCTGCGCCTCTCTCAACTTCATCGA 
I CTCCACCAACACAGTCACTCCCACTGCCTCCTTCAAACCCCTGGGTCTGGCCAATGACACTG 
ACCACTACTTTCTGCGCTATGCTGTGCTGCCGCGGGAGGTGGTCTGCACCGAAAACCTCACC 
CCCTGGAAGAAGCTCTTGCCCTGTAGTTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGA 
y TCGCTTGTTCCACACCAGCTACCACTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATG 
CACGCTGTACTAGCATCTCCTGGGAGCTGAGGCAGACCCTGTCAGTTGTATTTGATGCCTTC 
ATCACGGGGCAGGGAAAGAAAGACTGGTCCCTCTTCCGGATGTTCTCCCGAACCCTCACGGA 
GCCCTGCCCCCTGGCTTCAGAGAGCCGAGTCTATGTGGACATCACCACCTACAACCAGGACA 
ACGAGACATTAGAGGTGCACCCACCCCCGACCACTACATATCAGGACGTCATCCTAGGCACT 
CGGAAGACCTATGCCATCTATGACTTGCTTGACACCGCCATGATCAACAACTCTCGAAACCT 
CAACATCCAGCTCAAGTGGAAGAGACCCCCAGAGAATGAGGCCCCCCCAGTGCCCTTCCTGC 
ATGCCCAGCGGTACGTGAGTGGCTATGGGCTGCAGAAGGGGGAGCTGAGCACACTGCTGTAC 
H AACACCCACCCATACCGGGCCTTCCCGGTGCTGCTGCTGGACACCGTACCCTGGTATCTGCG 
^ GCTGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAACAAACCAAGTTACATCC 
ACTACCAGCCTGCCCAGGACCGGCTGCAACGCCACCTCCTGGAGATGCTGATTCAGCTGCCG 
GCCAACTCAGTCACCAAGGTTTCCATCCAGTTTGAGCGGGCGCTGCTGAAGTGGACCGAGTA 
CACGCCAGATCCTAACCATGGCTTCTATGTCAGCCCATCTGTCCTCAGCGCCCTTGTGCCCA 
GCATGGTAGCAGCCAAGCCAGTGGACTGGGAAGAGAGTCCCCTCTTCAACAGCCTGTTCCCA 
GTCTCTGATGGCTCTAACTACTTTGTGCGGCTCTACACGGAGCCGCTGCTGGTGAACCTGCC 
GACACCGGACTTCAGCATGCCCTACAACGTGATCTGCCTCACGTGCACTGTGGTGGCCGTGT 
GCTACGGCTCCTTCTACAATCTCCTCACCCGAACCTTCCACATCGAGGAGCCCCGCACAGGT 
GGCCTGGCCAAGCGGCTGGCCAACCTTATCCGGCGCGCCCGAGGTGTCCCCCCACTCTGATT 
CTTGCCCTTTCCAGCAGCTGCAGCTGCCGTTTCTCTCTGGGGAGGGGAGCCCAAGGGCTGTT 
TCTGCCACTTGCTCTCCTCAGAGTTGGCTTTTGAACCAAAGTGCCCTGGACCAGGTCAGGGC 
CTACAGCTGTGTTGTCCAGTACAGGAGCCACGAGCCAAATGTGGCATTTGAATTTGAATTAA 
CTTAGAAATTCATTTCCTCACCTGTAGTGGCCACCTCTATATTGAGGTGCTCAATAAGCAAA 
AGTGGTCGGTGGCTGCTGTATTGGACAGCACAGAAAAAGATTTCCATCACCACAGAAAGGTC 
GGCTGGCAGCACTGGCCAAGGTGATGGGGTGTGCTACACAGTGTATGTCACTGTGTAGTGGA 
TGGAGTTTACTGTTTG^GGAATAAAAACGGCTGTTTCCGTGGAAAAAAAAAAAA 
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MPIiALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDVAATFQFRTRWDSELQREGVSHY 
RLFPKALGQLISKYSLRELHLSFTQGFWRTRYWGPPFLQAPSGAELVWWFQDTVTDVDKSWK 
ELSNVLSGIFCASLNFIDSTNTVTPTASirivPLGLANDTDHYFLRYAVLPREVVCTENLTPWK 
KLLPCSSKAGLSVLLKADRLFHTSYHSQAVHIRPVCRNARCTSISWELRQTLSWFDAFITG 
QGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKT 
YAIYDLLDTAMINNSRNLNIQLKWKRPPENEAPPVPFLRAQRYVSGYGLQKGELSTLLYNTH 
PYRAFPVLLLDTVPWYLRLYVHTLTITSKGKENKPSYIHYQPAQDRLQPHLLEMLIQLPANS 
VTKVSIQFERALLKWTEYTPDPNHGFYVS^SVLSALVPSMVAAKPVDWEESPLFNSLFPVSD 
GSNYFVRLYTEPLLVNLPTPDFSMPYNVICLTCTWAVCYGSFYNLLTRTFHIEEPRTGGLA 
KRLtANL I RRARGVP PL 



BNSDOCtD: <WO. .9963083A2 ! > 



WO 99/63088 / PCT/US99/122S2 

FIGURE 239 

CAACATGGGGT CC AGCAGCTTCTTGGTCCTC ATGGTGT CT CT CGTT CTTGTG AC CCTGGTGG 
CTGTGGAAGGAGTTAAAGAGGGTATAGAGAAAGCAGGGGTTTGCCCAGCTGACAACGTACGC 
, v v TGCTTCAAGTCCGATCCTCCCCAGTGTCACACAGACCAGGACTGTCTGGGGGAAAGGAAGTG 
TTGTTACCTGCACTGTGGCTTCAAGTGTGTGATTCCTGTGAAGGAACTGGAAGAAGGAGGAA 

# ACAAGGATGAAGATGTGTCAAGGCCATACCCTGAGCCAGGATGGGAGGCCAAGTGTCCAGGC 
TCCTCCTCTACCAGGTGTCCTCAGAAATGATGCTGGGTCCTTTCTACCTCTGGGGGTCACTC 
TCACTTGGCACCTGCCCCTGAGGGTCCTGAGACTTGGAATATGGAAGAAGCAATACCCAACC 
CCACCAA^GAAAACCTGAGCTTGAAGTCCTTTTCCCCAAAAAGAGGGAAGAGTCACAAAAAG 

* TCCAGACCCCAGGGACGGTACTTTCCCTCTCTACCTGGTGCTCCTCCCTAATGCTCATGAAT 
GGACCCCTCATGAATGAAACCAGTGCCCTTATAAGAGACCCCAAAGAGCTGCCTTGCCCTTC 
TGCAATGTGTGATCACAGCTAGAAGGCACTGTCAGAGAAGAGAAACTGGTCCTCACCAGATG 
CTGAAT CT G C TGGTGC CT TG AT CT T GGACTT C C C AG C CT CT AGAACTGT AAG AAAT AAAT AT 

|j TTGCTGTTTATAATCCAA 
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MGS S S FL VLIWS LVLVTLVAVEGVKEG I EKAGVC P ADNVRC F KS DP PQ CHTDQD CLGERKCC 
YLHCGFKCVIPVKELEEGGNKDEDVSRPYPEPGWEAKCPGSSSTRCPQK 
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AAACTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGG 
AAACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAGAACCTTCC 
TCTAGAACCCGACCCACCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGCCAAGG 
CGTCCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTA 
TTAAGGAGC CTCAAACAAAG C CTT CCAGGC AT CAACG C AC AG AGAA CATT AAAGAAAGGTCT 
CTACAGTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTA 
TGCAGAGCCAGCGCCAGAGAACAATGCCCTCAACACACAAACCCAGCCCAAGGCCCACACCA 
CCGGAGACAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCCAC 
ACAGCACAGAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCATGGTGAACACACTGTC 
ACCCAGAGGGCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCC 
AGGACACAAAGACGACCCAAGGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACG 
GTGTCAGAGAAGCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAAAAGTCA 
GC AC AG AATGCTGGCTCCCACAGG AG CAGTGTCAACAAGGACGAGACAGAAAGGAGTGACC A 
CAGCAGTCATCCCACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAG 
AGCCCCACGACGCAGAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTG 
GGATTTTGAGGAAAAATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTG 
TGAAGATCAAAGCCTCCAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTC 
TTCCTGGACTCCAGACACTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACC 
CTTTGGCTTCATGGAGCTCAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAG 
TGCCCCAGCAGCAGCTGCTCCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGT 
GCCGTGGTGGGCAACGGGGG CATC CTGAACAACTC CCA CATGGGCCAGGAGATAGACAGTCA 
CGACTACGTGTTCCGATTGAGCGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTC 
GGACATCCTTCTACGGCTTTACCGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAAT 
CGGGGTTTCAAGAACGTGCCTCTTGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCAC 
CCGGGACTATGAGTGGCTGGAAGCACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTT 
TCTGGTTCAGGCACAGACCCCAGGAAGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTG 
TTGCTGCACCCAGACTTTCTCCGATACATGAAGAACAGGTTTCTGAGGTCTAAGACCCTGGA 
TGGTGCCCACTGGAGGATATACCGCCCCACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTC 
AGCTCTGTGACCAGGTGAGTGCTTATGGCTTCATCACTGAGGGCCATGAGCGCTTTTCTGAT 
CACTACTATGATACATCATGGAAGCGGCTGATCTTTTACATAAACCATGACTTCAAGCTGGA 
GAGAGAAGTCTGGAAGCGGCTACACGATGAAGGGATAATCCGGCTGTACCAGCGTCCTGGTC 
CCGGAACTGCCAAAGCCAAGAACTGACCGGGGCCAGGGCTGCCATGGTCTCCTTGCCTGCTC 
CAAGGCACAGGATACAGTGGGAATCTTGAGACTCTTTGGCCATTTCCCATGGCTCAGACTAA 
GCTCCAAGCCCTTCAGGAGTTCCAAGGGAACACTTGAACCATGGACAAGACTCTCTCAAGAT 
GGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCAGTACATTGCTGTAGGTCCTGAGGCCAGG 
GATTTTTAATTAAATGGGGTGATGGGTGGCCAATACCACAATTCCTGCTGAAAAACACTCTT 
CCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCCTGGATTTACAGAAACATATAGATCTG 
GTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCTTGAAGGTATTACTTAACTTCACTAC 
AGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTCTAGAAGGGTCTATACTTGTCCTTG 
TCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAACTGATAATAATACAAATGATTGTT 
GTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAAAAAAAAAA 
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MR S CL WR CRHL S QGVQ WS LLLAVL VF FL F AL P S F I KE P Q T K P S RHQRT EN I KERSLQ S LAKP 
KSQAPTRARRTTIYAEPAPE3SnSTALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAW 
KSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEICHQG 
KAATTAKTLI PKSQHRMLAPTGAVSTRTRQKGVTTAVI PPKEKKPQATPPPAPFQSPTTQRN 
QRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHF 
NQSEWDRLEHFAPPFGFMELNYSLVQKWTRFPPVPQQQLLLASLPAGSLRCITCAWGNGG 
I LNNSHMGQE IDSHDYVFRLSGAL I KGYEQDVGTRTSF YGFTAFSLTQSLL I LGNRGFKNVP 
LGKDVRYLHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYLLLHPDFL 
RYMKNRFLRSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSW 
KRLI FYINHDFKLEREVWKRLHDEGI IRLYQRPGPGTAKAKN 
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CGATGCGCGGACCCGGGCACCCCCTCCTCCTGGGGCTGCTGCTGGTGCTGGGGCCTTCGCCG 
GAGCAGCGAGTGGAAATTGTTCCTCGAGATCTGAGGATGAAGGACAAGTTTCTAAAACACCT 
TACAGGCCCTCTTTATTTTAGTCCAAAGTGCAGCAAACACTTCCATAGACTTTATCACAACA 
CCAGAGACTGCACCATTCCTGCATACTATAAAAGATGCGCCAGGCTTCTTACCCGGCTGGCT 
GTCAGTCCAGTGTGCATGGAGGATAAGTCAGCAGACCGT^.CAGGAGCAGCACACCAGGAGCC 
ATGAGAAGTGCCTTGGAAACCAACAGGGAAACAGAACTAT CTTTATACACATCCCCTCATGG 
ACAAGAGATTTATTTTTGCAGACAGACTCTTCCATAAGTCCTTTGAGTTTTGTATGTTGTTG 
ACAGTTTGCAGATATATATTCGATAAATCAJTGTACTTGACAGTGTTATCTGTCACTTATTT 
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MRGPGHPLLLGLLLVLGPSPEQRVEIVPRDLiRMKDKFLKHLTGPLYFSPKCSKHFHRLYHNT 
RDCT I PAYYKRCARLLTRLAVS PVCMEDK 
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FIGURE 245 

GGGCTGGGCCCCGCCGCAGCTCCAGCTGGCCGGCTTGGTCCTGCGGTCCCTTCTCTGGGAGG 
CCCGACCCCGGCCGCGCCCAGCCCCCACCATGCCACCCGCGGGGCTCCGCCGGGCCGCGCCG 
CTCACCGCAATCGCTCTGTTGGTGCTGGGGGCTCCCCTGGTGCTGGCCGGCGAGGACTGCCT 
GTGGTACCTGGACCGGAATGGCTCCTGGCATCCGGGGTTTAACTGCGAGTTCTTCACCTTCT 

§j GCTGCGGGACCTGCTACCATCGGTACTGCTGCAGGGACCTGACCTTGCTTATCACCGAGAGG 

I 

■ CAGCAGAAGCACTGCCTGGCCTTCAGCCCCAAGACCATAGCAGGCATCGCCTCAGCTGTGAT 
CCTCTTTGTTGCTGTGGTTGCCACCACCATCTGCTGCTTCCTCTGTTCCTGTTGCTACCTGT 
ACCGCCGGCGCCAGCAGCTCCAGAGCCCATTTGAAGGCCAGGAGATTCCAATGACAGGCATC 
CCAGTGCAGCCAGTATACCCATACCCCCAGGACCCCAAAGCTGGCCCTGCACCCCCACAGCC 
TGGCTTCATGTACCCACCTAGTGGTCCTGCTCCCCAATATCCACTCTACCCAGCTGGGCCCC 
CAGTCTACAACCCTGCAGCTCCTCCTCCCTATATGCCACCACAGCCCTCTTACCCGGGAGCC 
TGAGGAACCAGCCATGTCTCTGCTGCCCCTTCAGTGATGCCAACCTTGGGAGATGCCCTCAT 
B| CCTGTACCTGCATCTGGTCCTGGGGGTGGCAGGAGTCCTCCAGCCACCAGGCCCCAGACCAA 
^ GCCAAGCCCTGGGCCCTACTGGGGACAGAGCCCCAGGGAAGTGGAACAGGAGCTGAACTAGA 
ACTATGAGGGGTTGGGGGGAGGGCTTGGAATTATGGGCTATTTTTACTGGGGGCAAGGGAGG 
GAGATGACAGCCTGGGTCACAGTGCCTGTTTTCAAATAGTCCCTCTGCTCCCAAGATCCCAG 
CCAGGAAGGCTGGGGCCCTACTGTTTGTCCCCTCTGGGCTGGGGTGGGGGGAGGGAGGAGGT 
TCCGTCAGCAGCTGGCAGTAGCCCTCCTCTCTGGCTGCCCCACTGGCCACATCTCTGGCCTG 
CTAGATTAAAGCTGTAAAGACAAAA 
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MPPAGLRRAAPLTAIALLVLGAPLVLAGEDCLWYLDRNGSWHPGFNCEFFTFCCGTCYHRYC 
CRDLTLLITERQQKHCLAFSPKTIAGIASAVILFVAWATTICCFLCSCCYLYRRRQQLQSP 
FEGQE I PMTGI PVQPVYP YPQD PKAG PAP PQPGFMYP P SG P APQ YPLYPAGP PVYNPAAPPP 
YMPPQPSYPGA 
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GGGGGAGCTAGGCCGGCGGCAGTGGTGGTGGCGGCGGCGCAAGGGTGAGGGCGGCCCCAGAA 
CCCCAGGTAGGTAGAGCAAGAAGATGGTGTTTCTGCCCCTCAAATGGTCCCTTGCAACCATG 
TCATTTCTACTTTCCTCACTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAG 
CACTGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGAATAAAATACGACTTC 
^ CTGAGTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACC 
^ TTCTGGGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCA 
TAGT C AC C AC CT G C AG AT AT CT AGGG C C AC C CTC AGGAAGGGAGCTGG AG AG AGG CT AT CGG 
,v.; AAGAACCCCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCTGGCTCCCGAG 
f CCCCTCCTTGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGAC 
:| TTTCCACGGATTTTACAAAAGCACCTACAGAACCAAGGAAGGGGAACTGAGGATACTAGCAT 
CAACACAATTTGAACCCACTGCAGCTAGAATGGCCTTTCCCTGCTTTGATGAACCTGCCTTC 
AAAG CAAGTTTCTCAATCAAAATTAGAAGAGAG CC AAGG C ACC TAGC CAT CTCCAATATGC C 
ATTGGTGAAATCTGTGACTGTTGCTGAAGGACTCATAGAAGACCATTTTGATGTCACTGTGA 
^ AGATGAGCACCTATCTGGTGGCCTTCATCATTTCAGATTTTGAGTCTGTCAGCAAGATAACC 
AAGAGTGGAGTCAAGGTTTCTGTTTATGCTGTGCCAGACAAGATAAATCAAGCAGATTATGC 
ACTGGATGCTGCGGTGACTCTTCTAGAATTTTATGAGGATTATTTCAGCATACCGTATCCCC 
TACCCAAACAAGATCTTGCTGCTATTCCCGACTTTCAGTCTGGTGCTATGGAAAACTGGGGA 
CTGACAACATATAGAGAATCTGCTCTGTTGTTTGATGCAGAAAAGTCTTCTGCATCAAGTAA 
GCTTGGCATCACAGTGACTGTGGCCCATGAACTGGCCCACCAGTGGTTTGGGAACCTGGTCA 
CTATGGAATGGTGGAATGATCTTTGGCTAAATGAAGGATTTGCCAAATTTATGGAGTTTGTG 
TCTGTCAGTGTGACCCATCCTGAACTGAAAGTTGGAGATTATTTCTTTGGCAAATGTTTTGA 
CGCAATGGAGGTAGATGCTTTAAATTCCTCACACCCTGTGTCTACACCTGTGGAAAATCCTG 
CTCAGATCCGGGAGATGTTTGATGATGTTTCTTATGATAAGGGAGCTTGTATTCTGAATATG 
^ CTAAGGGAGTATCTTAGCGCTGACGCATTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCA 
TAGCTATAAAAATACAAAAAACGAGGACCTGTGGGATAGTATGGCAAGTATTTGCCCTACAG 
ATGGTGTAAAAGGGATGGATGGCTTTTGCTCTAGAAGTCAACATTCATCTTCATCCTCACAT 
TGGCATCAGGAAGGGGTGGATGTGAAAACCATGATGAACACTTGGACACTGCAGAGGGGTTT 
TCCCCTAATAACCATCACAGTGAGGGGGAGGAATGTACACATGAAGCAAGAGCACTACATGA 
AGGGCTCTGACGGCGCCCCGGACACTGGGTACCTGTGGCATGTTCCATTGACATTCATCACC 
AGCAAATCCAACATGGTCCATCGATTTTTGCTAAAAACAAAAACAGATGTGCTCATCCTCCC 
AGAAGAGGTGGAATGGATCAAATTTAATGTGGGCATGAATGGCTATTACATTGTGCATTACG 
AGG ATG ATGGATGGG ACT CTT TGACTGGC CTTT TAAAAG G AAC AC AC AC AG CAGT CAGCAGT 
AATGATCGGGCAAGTCTCATTAACAATGCATTTCAGCTCGTCAGCATTGGGAAGCTGTCCAT 
TGAAAAGGCCTTGGATTTATCCCTGTACTTGAAACATGAAACTGAAATTATGCCCGTGTTTC 
AAGGTTTGAATGAGCTGATTCCTATGTATAAGTTAATGGAGAAAAGAGATATGAATGAAGTG 
GAAACTCAATTCAAGGCCTTCCTCATCAGGCTGCTAAGGGACCTCATTGATAAGCAGACATG 
GACAGACGAGGGCTCAGTCTCAGAGCAAATGCTGCGGAGTGAACTACTACTCCTCGCCTGTG 
TGCACAACTATCAGCCGTGCGTACAGAGGGCAGAAGGCTATTTCAGAAAGTGGAAGGAATCC 
AATGGAAACTTGAGCCTGCCTGTCGACGTGACCTTGGCAGTGTTTGCTGTGGGGGCCCAGAG 
CACAGAAGGCTGGGATTTTCTTTATAGTAAATATCAGTTTTCTTTGTCCAGTACTGAGAAAA 
GCCAAATTGAATTTGCCCTCTGCAGAACCCAAAATAAGGAAAAGCTTCAATGGCTACTAGAT 
GAAAGCTTTAAGGGAGATAAAATAAAAACTCAGGAGTTTCCACAAATTCTTACACTCATTGG 
^ CAGGAACCCAGTAGGATACCCACTGGCCTGGCAATTTCTGAGGAAAAACTGGAACAAACTTG 
TACAAAAGTTTGAACTTGGCTCATCTTCCATAGCCCACATGGTAATGGGTACAACAAATCAA 
TTCTCCACAAGAACACGGCTTGAAGAGGTAAAAGGATTCTTCAGCTCTTTGAAAGAAAATGG 
TTCTCAGCTCCGTTGTGTCCAACAGACAATTGAAACCATTGAAGAAAACATCGGTTGGATGG 
1 ATAAGAATTTTGATAAAATCAGAGTGTGGCTGCAAAGTGAAAAGCTTGAACGTATG TAA AAA 
TTCCTCCCTTGCCCGGTTCCTGTTATCTCTAATCACCAACATTTTGTTGAGTGTATTTTCAA 
ACTAGAGATGGCTGTTTTGGCTCCAACTGGAGATACTTTTTTCCCTTCAACTCATTTTTTGA 
CTATCCCTGTGAAAAGAATAGCTGTTAGTTTTTCATGAATGGGCTTTTTCATGAATGGGCTA 
TCGCTACCATGTGTTTTGTTCATCACAGGTGTTGCCCTGCAACGTAAACCCAAGTGTTGGGT 
TCCCTGCCACAGAAGAATAAAGTACCTTATTCTTCTCAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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MVFLPLKWSLATMSFLLSSLLALLTVSTPSWCQSTEASPKRSDGTPFPWNKIRLPEYVIPVH 
YDLL IHANLTTLTFWGTTKVE I TASQPTSTI I LHSHHLQ I SRATLRKGAGERLS EE PLQVLE 
HPPQEQIALLAPEPLLVGLPYTWIHYAGNLSETFHGFYKSTYRTKEGELRIIiASTQFEPTA 
ARMAFPCFDEPAFKASFSIKIRREPRHLAISNMPLVKSVTVAEGLIEDHFDVTVKMSTYLVA 
FIISDFESVSKITKSGVKVSVYAVPDKINQADYALDAAVTLLEFYEDYFSIPYPLPKQDLAA 
IPDFQSGAME^GLTTYRESALLFDAEKSSASSKLGITVTVAHELAHQWFGNLVTMEWWNDL 
WLNEGFAKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVD7VLNSSHPVSTPVENPAQIREMFD 
DVSYDKGACILNMLREYLSADAFKSGIVQYLQKHSYKNTKNEDLWDSMASICPTDGVKGMDG 
FCSRSQHSSSSSHWHQEGVDVKTMMNTWTLQRGFPLITITVRGR3WHMKQEHYMKGSDGAPD 
TG YL WH VP LiT FITS KSNMVHRF LL KT KTD VL I L P E E VE W I K F NV GMNG Y Y I VH Y EDDGWD S L 
TGLLKGTHTAVSSNDRASLINNAFQLVS IGKLS I EKALDLSL YLKHETEIMPVFQGLNELI P 
MYKLMEKRDMNEVETQFKAFLIRLLRDLIDKQTWTDEGSVSEQMLRSELLLLACVHNYQPCV 
QRAEGYFRKWKESNGNLSLPVDVTLAVFAVGAQSTE GWDF L Y S K YQFS LS STEKSQ I E FALC 
RTQNKEKLQWLLDESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNKLVQKFELGS 
SSIAHMVMGTTNQFSTRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIR 
VWLQSEKLERM 
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CAGCCACAGACGGGTCATGAGCGCGGTATTACTGCTGGCCCTCCTGGGGTTCATCCTCCCAC 
TGCCAGGAGTGCAGGCGCTGCTCTGCCAGTTTGGGACAGTTCAGCATGTGTGGAAGGTGTCC 
GACCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCTTGGGGTGCCAGGA 

: - CACGTTGATGCTCATTGAGAGCGGACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGG 
AGGCCAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATC 

| TCCTACACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCT 
TTGGGCCCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCCAGTCTGCTTGTCTATGG 
AAGGCTGTCTGGAGGGGACAACAGAAGAGATCTGCCCCAAGGGGACCACACACTGTTATGAT 
GGCCTCCTCAGGCTCAGGGGAGGAGGCATCTTC JCCAATCTGAGAGTCCAGGGATGCATGCC 

^ CCAGCCAGGTTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGA 
ACTGCAATAGGAAAGATTTTCTGACCTGTCATCGGGGGACCACCATTATGACACACGGAAAC 
TTGGCTCAAGAACCCACTGATTGGACCACATCGAATACCGAGATGTGCGAGGTGGGGCAGGT 
GTGTCAGGAGACGCTGCTGCTCATAGATGTAGGACTCACATCAACCCTGGTGGGGACAAAAG 
GCTGCAGCACTGTTGGGGCTCAAAATTCCCAGAAGACCACCATCCACTCAGCCCCTCCTGGG 
GTGCTTGTGGCCTCCTATACCCACTTCTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAG 
CAGCGTTCTGCTGAACTCCCTCCCTCCTCAAGCTGCCCCTGTCCCAGGAGACCGGCAGTGTC 
CTACCTGTGTGCAGCCCCTTGGAACCTGTTCAAGTGGCTCCCCCCGAATGACCTGCCCCAGG 
GGCGCCACTCATTGTTATGATGGGTACATTCATCTCTCAGGAGGTGGGCTGTCCACCAAAAT 
GAGCATTCAGGGCTGCGTGGCCCAACCTTCCAGCTTCTTGTTGAACCACACCAGACAAATCG 
GGATCTTCTCTGCGCGTGAGAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGT 
GGGGCTGAGGGCCTGGAGTGTCTCACTTGGGGGGTGGGGCTGGCACTGGCCCCAGCGCTGTG 
GTGGGGAGTGGTTTGCCCTTCCTGCTAACTCTATTACCCCCACGATTCTTCACCGCTGCTGA 
CCACCCACACTCAACCTCCCTCTGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTC 
CCATTCTGTCCATGAATCATCTTCCCCACACACAATCATTCATATCTACTCACCTAACAGCA 
ACACTGGGGAGAGCCTGGAGCATCCGGACTTGCCCTATGGGAGAGGGGACGCTGGAGGAGTG 

*S GCTGCATGTATCTGATAATACAGACCCTGTCCTTTCA 
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MSAVLLI^LLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLMLI 
ESGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLWAPQP 
PADPGSLRCPVCLSMEGCLEGTTEEICPKGTTHCYDGLLRLRGGGIFSNLRVQGCMPQPGCN 
LLNGTQEIGPVGMTENCNRKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEVGQVCQETL 
LLIDVGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLN 
SLPPQAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGC 
VAQPSS FLLNHTRQI GI FSAREKRDVQPPASQHEGGGAEGLE S LTWGVGLALAPALWWGVVCPS C 
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FIGURE 251 

GCGACGGGCAGGACGCCCCGTTCGCCTAGCGCGTGCTCAGGAGTTGGTGTCCTGCCTGCGCT 
CAGGATGAGGGGGAATCTGGCCCTGGTGGGCGTTCTAATCAGCCTGGCCTTCCTGTCACTGCTG 
CCATCTGGACATCCTCAGCCGGCTGGCGATGACGCCTGCTCTGTGCAGATCCTCGTCCCTGG 
CCTCAAAGGGGATGCGGGAGAGAAGGGAGACAAAGGCGCCCCCGGACGGCCTGGAAGAGTCG 
3 GCCCCACGGGAGAAAAAGGAGACATGGGGGACAAAGGACAGAAAGGCAGTGTGGGTCGTCAT 
^ GGAAAAATTGGTCCCATTGGCTCTAAAGGTGAGAAAGGAGATTCCGGTGACATAGGACCCCC 
TGGTCCTAATGGAGAACCAGGCCTCCCATGTGAGTGCAGCCAGCTGCGCAAGGCCATCGGGG 
V ' AGATGGACAACCAGGTCTCTCAGCTGACCAGCGAGCTCAAGTTCATCAAGAATGCTGTCGCC 
GGTGTGCGCGAGACGGAGAGCAAGATCTACCTGCTGGTGAAGGAGGAGAAGCGCTACGCGGA 
CGCCCAGCTGTCCTGCCAGGGCCGCGGGGGCACGCTGAGCATGCCCAAGGACGAGGCTGCCA 
ATGGCCTGATGGCCGCATACCTGGCGCAAGCCGGCCTGGCCCGTGTCTTCATCGGCATCAAC 
GACCTGGAGAAGGAGGGCGCCTTCGTGTACTCTGACCACTCCCCCATGCGGACCTTCAACAA 
■4 GTGGCGCAGCGGTGAGCCCAACAATGCCTACGACGAGGAGGACTGCGTGGAGATGGTGGCCT 
I CGGGCGGCTGGAACGACGTGGCCTGCCACACCACCATGTACTTCATGTGTGAGTTTGACAAG 
GAGAACATGTGAGCCTCAGGCTGGGGCTGCCCATTGGGGGCCCCACATGTCCCTGCAGGGTT 
GGCAGGGACAGAGCCCAGACCATGGTGCCAGCCAGGGAGCTGTCCCTCTGTGAAGGGTGGAG 
GCTCACTGAGTAGAGGGCTGTTGTCTAAACTGAGAAAATGGCCTATGCTTAAGAGGAAAATG 
AAAGTGTTCCTGGGGTGCTGTCTCTGAAGAAGCAGAGTTTCATTACCTGTATTGTAGCCCCA 
ATGTCATTATGTAATTATTACCCAGAATTGCTCTTCCATAAAGCTTGTGCCTTTGTCCAAGC 
TATACAATAAAATCTTTAAGTAGTGCAGTAGTTAAGTCCAAAAAAAAAAAAAAAAAAA 
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FIGURE 252 



MRGNLALVGVL I SLAFLSLLPSGHPQPAGDDACS VQ I LVPGLKGDAGEKGDKGAPGRPGRVG 
PTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGE 
MDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKSEKRYADAQLSCQGRGGTLSMPKDEAAN 
GLMAAYLAQAGLARVF IG INDLEKEGAFVYSDHS PMRTFNKWRSGE PNNAYDEEDCVEMVAS 
GGWNDVACHTTMYFMCEFDKENM 



i 
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FIGURE 253 

AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCGGTTTCTCTCTTTGCAGGAGCACCGGCAG 
CACCAGTGTGTGAGGGGAGCAGGCAGCGGTCGTAGCCAGTTCCTTGATCCTGCCAGACCACC 
CAGCCCCCGGCACAGAGCTGCTCCACAGGCACCATGAGGATCATGCTGCTATTCACAGCCAT 
CCTGGCCTTCAGCCTAGCTCAGAGCTTTGGGGCTGTCTGTAAGGAGCCACAGGAGGAGGTGG 
! : | TTCCTGGCGGGGGCCGCAGCAAGAGGGATCCAGATCTCTACCAGCTGCTCCAGAGACTCTTC 
^ AAAAGCCACTCATCTCTGGAGGGATTGCTCAAAGCCCTGAGCCAGGCTAGCACAGATCCTAA 
GGAATCAACATCTCCCGAGAAACGTGAGATGCATGACTTCTTTGTGGGACTTATGGGCAAGA 
GGAGCGTCCAGCCAGAGGGAAAGACAGGACCTTTCTTACCTTCAGTGAGGGTTCCTCGGCCC 
a CTTCATCCCAATCAGCTTGGATCCACAGGAAAGTCTTCCCTGGGAACAGAGGAGCAGAGACC 
TTTATAAGACTCTCCTACGGATGTGAATCAAGAGAACGTCCCCAGCTTTGGCATCCTCAAGT 
ATCCCCCGAGAGCAGAATAGGTACTCCACTTCCGGACTCCTGGACTGCATTAGGAAGACCTC 
TTTCCCTGTCCCAATCCCCAGGTGCGCACGCTCCTGTTACCCTTTCTCTTCCCTGTTCTTGT 
$ AACATTCTTGTGCTTTGACTCCTTCTCCATCTTTTCTACCTGACCCTGGTGTGGAAACTGCA 
^ TAGTGAATATCCCCAACCCCAATGGGCATTGACTGTAGAATACCCTAGAGTTCCTGTAGTGT 
CCTACATTAAAAATATAATGTCTCTCTCTATTCCTCAACAATAAAGGATTTTTGCATATGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MRIMLLFTAILAFSLAQSFGAVCKEPQEEWPGGGRSKRDPDLYQLLQRLFKSHSSLEGLLK 
ALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFLPSVRVPRPLHPNQLGSTGK 
SSLGTEEQRPL 
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FIGURE 255 

GGGCGTCTCCGGCTGCTCCTATTGAGCTGTCTGCTCGCTGTGCCCGCTGTGCCTGCTGTGCC 
CGCGCTGTCGCCGCTGCTACCGCGTCTGCTGGACGCGGGAGACGCCAGCGAGCTGGTGATTG 
GAGCCCTGCGGAGAGCTCAAGCGCCCAGCTCTGCCCCAGGAGCCCAGGCTGCCCCGTGAGTC 
CCATAGTTGCTGCAGGAGTGGAGCCATGAGCTGCGTCCTGGGTGGTGTCATCCCCTTGGGGC 
TGCTGTTCCTGGTCTGCGGATCCCAAGGCTACCTCCTGCCCAACGTCACTCTCTTAGAGGAG 
CTGCTCAGCAAATACCAGCACAACGAGTCTCACTCCCGGGTCCGCAGAGCCATCCCCAGGGA 
GGACAAGGAGGAGATCCTCATGCTGCACAACAAGCTTCGGGGCCAGGTGCAGCCTCAGGCCT 
CCAACA '"3GAGTACATGGTGAGCGCCGGCTCCGGCCGCAGAGGCTGGCACCGGGGGTGGGGC 
CTGGGCCACCAGCCTGCTCTGTTCCCCAGCCAGCTCTGTTCCCCAGCCAGTGCGTGTGATGG 
CTGGCTCAGGGTCTCCTCTGGCAGGGGAGGATCCCGGCTCTGTTCTGTTTTGTTTGTTTGTT 
TTGAGACAGGGTCTCACTCTGCCACTGACGCTGGAGTGCAATGGCACAATCGTCATGCCCTG 
AAAC CTTAGACTC CCGGGGTTAAGCGATCCTGCTTC AG CCT C CC AAGT AG CTGGAACTAC AG 
GC ATGCAC CATGGTGC CCAGCTAGATTTTAAAT ATTTTGTGG AG AT GGGGGT CTTG CTACGT 
TGCCCAGGCTGGTCTTGAACTCCTAGGCTCAAGCAATCCTCCTGCCTCAGCCTCTCAAAGTG 
CTAGGATTATAGGCATGAGTCACCCTGTCTGGCTCTGGCTCTGTTCTTAACATTCTGCCAAA 
ACAACACACGTGGGTTCCCTGTGCAGAGCCTGCCTCGTTGCCTTCATGTCACTCTTGGTAGC 
TCCACTGGGAACACAGCTCTCAGCCTTTCCCACCTGGAGGCAGAGTGGGGAGGGGCCCAGGG 
CTGGGCTTTGCTGATGCTGATCTCAGCTGTGCCACACGCTAGCTGCACCACCCTGACTTCTC 
CTTAGCCCGTGTGAGCCTCACTTTCCACTTGGAGAGTCCTTCCTCGCGTGGTTGCCATGACT 
GTGAGATAAGTCGAGGCTGTGAAGGGCCCGGCACAGACTGACCTGCCTCCCCAACCCCTAGG 
CTTTGCTAACCGGGAAAGGAGCTAACGGTGACAGAAGACAGCCAAGGTCAACCCTCCCGGGT 
GATTGTGATGGGTGTTCCAGGTGTGGTTGGGCGATGCTGCTACTTGACCCCAAGCTCCAGTG 
TGGAAACTTCCTTCCTGGCTGGTTTTCCAGAACTACAGAGGAATGGACCACAGTCTTCCAGG 
GTCCCTCCTCGTCCACCAACCGGGAGCCTCCACCTTGGCCATCCGTCAGCTATGAATGGCTT 
TTTAAACAAACCCACGTCCCAGCCTGGGTAACATGGTAAAGCCCCGTCTCTACAAAAAAATC 
CAAGTTAGCCGGGCATGGTGGTGCGCACCTGTAGTCCCAGCTGCAGTGGGACTGAGGTGGAG 
GTGGAGGTGGGGGGTGGGAGCTGAGGAAGGAGGATCGCTTGAGCCTGGGAAGTCGAGGCTGC 
AGTGAGCTGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCAAAAA 
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MSCVLGGVIPLGLLFLVCGSQGYLLPNVTLLEELLSKYQHNESHSRVRRAIPREDKEEILML 
HNKLRGQVQPQASNMEYMVSAGSGRRGWHRGVJGLGHQPALFPSQLCSPASACDGWLRVSSGR 
GGSRLCSVLFVCFETGSHSATDAGVQWHNRHALKP 
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FIGURE 257 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATG GG 
GTCTGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGG 
GTATGACTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGC 
TTCCTGGAATTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCAC 
CCTCCACCATGCAAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTG 
TGTCCTTCTTGGCCCGGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTT 
CAGCAGGCCCCCACCCTCCTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 258 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTS 
VTLHHARSQHHWCNT 
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AATTGTATCTGTGTAATGTTAAAACAAACGAAATAAAATAGAAGGAAAAACTTTCTGAGTTT 
CAAAAACAACAGACTAGTACTCTAAAGAACTCTTTAAAACAATTAACTGTTAGGATTGCAGT 

^ TATGATTGGATATTATTTAATTCTGTTTCrGATGTGGGGTTCCTCCACTGTGTTCTGTGTGC 
TATTAATATTTACCATTGCAGAAGCTTCATTCAGTGTTGAAAATGAATGCTTAGTGGATCTG 

•I TGCCTCTTACGCATATGTTACAAATTATCTGGAGTTCCT?VATCAATGCAGAGTTCCCCTCCC 
CTCCGATTGTTCTAAATAATTGAAAGATGTCTGCTGTGG.AAAAAGGCATGTATTTAAATCTG 
TATGATTCTCAACCATCTTTAGTTGGGAAAGGTCCTTGAAAGCCAATGGAAATACTTTTTTT 
TTTTCTTGGCACTAATCAAGTGAGTGTTAwCTTTTCACTTAGTAGGATGTGTTGTTACGCTA 
GTAAAATAGAAACCTGTGTTTATTCTCAGGTATTTTAGAAACAACAGCCATCATTTTATTTT 
ATGTGTGTGTTCTTGGCTGTATTCATAAATTATATATTTTGGGCTATCAAATATTACTTCAT 
TCAATATAAATAACAATAGTAGAAGTTGTTTACTTAGATATGCTTTCTAGTTGCATTTTCTC 
AGCCTATGTAAGACTACTTTGTTGTAATAGCCTTTGAAATTTACAGTACTGTCTCTCTACTA 

J TCTTCAGATTACTTGATTCAAATAAACCAATTATGTTTGTAATTGATATTAATAAAACCAGA 



ATAAAAGTTCATATCTACCC 
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FIGURE 260 

MI GYYL ILFLMWGSSTVFCVLL I FT I AEAS FS VENECLVDLCLLRI CYKLSGVPNQCRVPLP 
SDCSK 
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GAGGATTTGCCACAGCAGCGGATAGAGCAGGAGAGCACCACCGGAGCCCTTGAGACATCCTT 
\ GAGAAGAGCCACAGCATAAGAGACTGCCCTGCTTGGTGTTTTGCAGGATGATGGTGGCCCTT 
CGAGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGCAGCTTTTCTGCCCCCGCCGCAGTGTAC 
CCAGGACCCAGCCATGGTGCATTACATCTACCAGCGCTTTCGAGTCTTGGAGCAAGGGCTGG 
^ AAAAATGTACCCAAGCAACGAGGGCATACATTCAAGAATTCCAAGAGTTCTCAAAAAATATA 
TCTGTCATGCTGGGAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAACTT 
GGCACTGAGAGTTGAACGTGCCCAACGGGAGATTGACTACATACAATACCTTCGAGAGGCTG 
v| ACGAGTGCATCGTATCAGAGGACAAGACACTGGCAGAAATGTTGCTCCAAGAAGCTGAAGAA 
H GAGAAAAAGATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATGGGCATAAAGTC 
V ' TTTGAAAATAGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCTGTCTATA 
ACTCTCCAAAGGTGTACTTATTAATTGGATCCAGAAACAACACTGTTTGGGAATTTGCAAAC 
ATACGGGCATTCATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCTAACACTTTC 
CTGGCAGGGAACAGGCCAAGTGATCTACAAAGGTTTTCTATTTTTTCATAACCAAGCAACTT 
CTAATGAGATAATCAAATATAACCTGCAGAAGAGGACTGTGGAAGATCGAATGCTGCTCCCA 
GGAGGGGTAGGCCGAGCATTGGTTTACCAGCACTCCCCCTCAACTTACATTGACCTGGCTGT 
; GGATGAGCATGGGCTCTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGGTTCTCA 
CAAAGATTGAGCCGGGCACACTGGGAGTGGAGCATTCATGGGATACCCCATGCAGAAGCCAG 
GATGCTGAAGCCTCATTCCTCTTGTGTGGGGTTCTCTATGTGGTCTACAGTACTGGGGGCCA 
GGGCCCTCATCGCATCACCTGCATCTATGATCCACTGGGCACTATCAGTGAGGAGGACTTGC 
CCAACTTGTTCTTCCCCAAGAGACCAAGAAGTCACTCCATGATCCATTACAACCCCAGAGAT 
% . AAGCAGCTCTATGCCTGGAATGAAGGAAACCAGATCATTTACAAACTCCAGACAAAGAGAAA 
% GCTGCCTCTGAAG3fAATGCATTACAGCTGTGAGAAAGAGCACTGTGGCTTTGGCAGCTGTTC 
'& TACAGGACAGTGAGGCTATAGCCCCTTCACAATATAGTATCCCTCTAATCACACACAGGAAG 
AGTGTGTAGAAGTGGAAATACGTATGCCTCCTTTCCCAAATGTCACTGCCTTAGGTATCTTC 
CAAGAGCTTAGATGAGAGCATATCATCAGGAAAGTTTCAACAATGTCCATTACTCCCCCAAA 
CCTCCTGGCTCTCAAGGATGACCACATTCTGATACAGCCTACTTCAAGCCTTTTGTTTTACT 
GCTG CCCAGCATTTACTGTAACTCTGCCATCTTCCCTCCCACAATTAGAGTTGTATGCCAGC 
CCCTAATATTCACCACTGGCTTTTCTCTCCCCTGGCCTTTGCTGAAGCTCTTCCCTCTTTTT 
CAAATGTCTATTGATATTCTCCCATTTTCACTGCCCAACTAAAATACTATTAATATTTCTTT 
CTTTTCTTTTCTTTTTTTTGAGACAAGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACTCC 
AGAGCTCAAGAGATCCTCCTGCCTCAGCCTCCTAAGTACCTGGGATTACAGGCATGTGCCAC 
CACACCTGGCTTAAAATACTATTTCTTATTGAGGTTTAACCTCTATTTCCCCTAGCCCTGTC 
CTTCCACTAAGCTTGGTAGATGTAATAATAAAGTGAAAATATTAACATTTGAATATCGCTTT 
CCAGGTGTGGAGTGTTTGCACATCATTGAATTCTCGTTTCACCTTTGTGAAACATGCACAAG 
TCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGTAACACAATTACAAAGTGAAAGATACAGC 
TAGAAAATACTACAAATCCCATAGTTTTTCCATTGCCCAAGGAAGCATCAAATACGTATGTT 
TGTTCACCTACTCTTATAGTCAATGCGTTCATCGTTTCAGCCTAAAAATAATAGTCTGTCCC 
TTTAGCCAGTTTTCATGTCTGCACAAGACCTTTCAATAGGCCTTTCAAATGATAATTCCTCC 
$ AGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCCTCCTCTTGTCTTGC'I'G TCCTCTGT 

TTCTCTCTTTCTGCTTTAAATTCAATAAAAGTGACACTGAGCAAAAAAAAAAAAAAA 
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MMVALRGASALLVLFLAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQE 
FSKNI S VMLGRCQT YTSE YKS AVGNLALRVERAQRE I D Y I QYLREADEC I VS ED KTLAEMLL 
QEAEEEKKIRTLLNAS CDNMLMG I KS LK I VKKMMDTHGSWMKDAVYNS PKVYLL I GSRNNTV 

" <; WEFANIRAFMEDNTKPAPRKQILTLSWQGTGQVIYKGFLFFHNQATSNEI IKYNLQKRTVED 

RMLLPGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSWDT 

§ PCRSQDAEASFLLCGVLYWYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIH 
YNPRDKQLYAWNEGNQIIYKLQTKRKLPLK 
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FIGURE 263 

GGGCGCCCGCGTACTCACTAGCTGAGGTGGCAGTGGTTCCACCAACATGGAGCTCTCGCAGA 
TGTCGGAGCTCATGGGGCTGTCGGTGTTGCTTGGGCTGCTGGCCCTGATGGCGACGGCGGCG 
GTAGCGCGGGGGTGGCTGCGCGCGGGGGAGGAGAGGAGCGGCCGGCCCGCCTGCCAAAAAGC 
AAATGGATTTCCACCTGACAAATCTTCGGGATCCAAGAAGCAGAAACAATATCAGCGGATTC 
"\ GGAAGGAGAAGCCTCAACAACACAACTTCACCCACCGCCTCCTGGCTGCAGCTCTGAAGAGC 
CACAGCGGGAACATATCTTGCATGGACTTTAGCAGCAATGGCAAATACCTGGCTACCTGTGC 
j| AGATGATCGCACCATCCGCATCTGGAGCACCAAGGACTTCCTGCAGCGAGAGCACCGCAGCA 
| TGAGAGCCAACGTGGAGCTGGACCACGCCACCCTGGTGCGCTTCAGCCCTGACTGCAGAGCC 
TTCATCGTCTGGCTGGCCAACGGGGACACCCTCCGTGTCTTCAAGATGACCAAGCGGGAGGA 
TGGGGGCTACACCTTCACAGCCACCCCAGAGGACTTCCCTAAAAAGCACAAGGCGCCTGTCA 
TCGACATTGGCATTGCTAACACAGGGAAGTTTATCATGACTGCCTCCAGTGACACCACTGTC 
CTCATCTGGAGCCTGAAGGGTCAAGTGCTGTCTACCATCAACACCAACCAGATGAACAACAC 
ACACGCTGCTGTATCTCCCTGTGGCAGATTTGTAGCCTCGTGTGGCTTCACCCCAGATGTGA 
AGGTTTGGGAAGTCTGCTTTGGAAAGAAGGGGGAGTTCCAGGAGGTGGTGCGAGCCTTCGAA 
CTAAAGGGCCACTCCGCGGCTGTGCACTCGTTTGCTTTCTCCAACGACTCACGGAGGATGGC 
TTCTGTCTCCAAGGATGGTACATGGAAACTGTGGGACACAGATGTGGAATACAAGAAGAAGC 
AGGACCCCTACTTGCTGAAGACAGGCCGCTTTGAAGAGGCGGCGGGTGCCGCGCCGTGCCGC 
CTGGCCCTCTCCCCCAACGCCCAGGTCTTGGCCTTGGCCAGTGGCAGTAGTATTCATCTCTA 
g CAATACCCGGCGGGGCGAGAAGGAGGAGTGCTTTGAGCGGGTCCATGGCGAGTGTATCGCCA 
% ACTTGTCCTTTGACATCACTGGCCGCTTTCTGGCCTCCTGTGGGGACCGGGCGGTGCGGCTG 
^ TTTCACAACACTCCTGGCCACCGAGCCATGGTGGAGGAGATGCAGGGCCACCTGAAGCGGGC 
CTCCAACGAGAGCACCCGCCAGAGGCTGCAGCAGCAGCTGACCCAGGCCCAAGAGACCCTGA 
AGAGCCTGGGTGCCCTGAAGAAGTGACTCTGGGAGGGCCCGGCGCAGAGGATTGAGGAGGAG 
GGATCTGGCCTCCTCATGGCACTGCTGCCATCTTTCCTCCCAGGTGGAAGCCTTTCAGAAGG 
AGTCTCCTGGTTTTCTTACTGGTGGCCCTGCTTCTTCCCATTGAAACTACTCTTGTCTACTT 
AGGTCTCTCTCTTCTTGCTGGCTGTGACTCCTCCCTGACTAGTGGCCAAGGTGCTTTTCTTC 
CTCCCAGGCCCAGTGGGTGGAATCTGTCCCCACCTGGCACTGAGGAGAATGGTAGAGAGGAG 
AGGAGAGAGAGAGAGAATGTGATTTTTGGCCTTGTGGCAGCACATCCTCACACCCAAAGAAG 
TTTGTAAATGTTCCAGAACAACCTAGAGAACACCTGAGTACTAAGCAGCAGTTTTGCAAGGA 
TGGGAGACTGGGATAGCTTCCCATCACAGAACTGTGTTCCATCAAAAAGACACTAAGGGATT 
TCCTTCTGGGCGTCAGTTCTATTTGTAAGATGGAGAATAATCCTCTCTGTGAACTCCTTGCA 
AAGATGATATGAGGCTAAGAGAATATCAAGTCCCCAGGTCTGGAAGAAAAGTAGAAAAGAGT 
AGTACTATTGTCCAATGTCATGAAAGTGGTAAAAGTGGGAACCAGTGTGCTTTGAAACCAAA 
TTAGAAACACATTCCTTGGGAAGGCAAAGTTTTCTGGGACTTGATCATACATTTTATATGGT 
^ TGGGACTTCTCTCTTCGGGAGATGATATCTTGTTTAAGGAGACCTCTTTTCAGTTCATCAAG 
TTCATCAGATATTTGAGTGCCCACTCTGTGCCCAAATAAATATGAGCTGGGGATTAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 264 

MELSQMSELMGLSVLLQLLAIjMATAAVARGWLRAGEERSGRPACQKANGFPPDKSSGSKKQK 
QYQR I RKEKPQQHNFTHRLLAAALKSHSGN I S CMDFS SNGK YLATCADDRT I R I WS TKDFLQ 
REHRSMRANVELDHATLVRFSPDCRAFIVWLANGDTLRVFKMTKREDGGYTFTATPEDFPKK 
HKAPVIDIGIANTGKFIMTASSDTTVLIWSLKGQVLSTINTNQMNNTHAAVSPCGRFVASCG 
FTPDVKVWEVCFGKKGEFQEWRAFELKGHSAAVHSFAFSNDSRRMASVSKDGTWKLWDTDV 
EYKKKQDPYLLKTGRFEEAAGAAPCRLALSPNAQVLALASGSSIHLYNTRRGEKEECFERVH 
GECIANLSFDITGRFLASCGDRAVRLFHNTPGHRAMVEEMQGHLKRASNESTRQRLQQQLTQ 
AQETLKSLGALKK 
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FIGURE 265 

TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCAGG 
CAGTGTTTTGCCTTCACCCCAAGTGACCATGAGAGGTGCCACG CG AGTCTCAATCATGCTCC 
TCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGT 
GGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCT 
| GGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTCAGGAAACGCA 
^ AGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCGGACGGCAGGTAC 
CGCTGCTCCATGGACTTGAAGAACATCAATTTTTAGGCGCTTGCCTGGTCTCAGGATACCCA 
CCATCCTTTTCCTGAGCACAGCCTGGATTTT^^TTTCTGCCATGAAACCCAGCTCCCATGAC 
TCTCCCAGTCCCTACACTGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAG 
GCAGACATACCTCCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGTCACAGCTTGAGG 
CTGTGGTGTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTA 
AATGGCAGAAAGGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTG 
& CCCCTCTCCCCACATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTCTTGGGT 
^ GCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGAGGTCTGTGAGG 
ACCAATTTGTGGGTAGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGAC 
TCAAGATTGGCTCTTCCCAGAGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCA 
GGGAGGCCAATCAGCCCCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGA 
CCTGTGACCTTCTGCCAGAATTGTCATGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGT 
TAACCACTGAAGCCCCCAATTCCCACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAA 
TCTAATCTGATATTGACATATTAGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCA 
AGGATCAGCCCTGAGAGCAGGTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGG 
TGGGAGCAAGGGACAGGGAGCAGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGG 
CAACT ACAC ACCAACATGCTGGCTTTAGAATAAAAG CAC CAA CTG AAAAAA 
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MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEECHP 
GSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 
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AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAATGTCTTTC 
CTCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGGGGGC 
TGCTGCCTTGGCATTGCTGCTTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGAAAGCGG 
CCCTGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGACTTTCAAA 
GCAAAGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGCCAGGCTGTTT 
CCTCTGTCGAGAGGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCG 
TCCCCCTCTATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGGATTTCCAGCCTTAT 
TTCAAAGGAGAAATCTTCCTGGATGAAAAGAAAAAGTTCTATGGTCCACAAAGGCGGAAGAT 
GATGTTTATGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCTTCCGAGCCTGGAACGGAG 
GCTTCTCTGGAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCA 
GGAAAGCAGGGCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCTACT 
TTCTGTTCTGGAAGCTGCTAAGATGATCAAACCACAGACTTTGGCCTCAGAGAAAAAATGAT 
TGTGTGAAACTGCCCAGCTCAGGGATAACCAGGGACATTCACCTGTGTTCATGGGATGTATT 
GTTTCCACTCGTGTCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTA 
TTAATGTATTTTAATATTCTGTTTAGGCCCACTAAGGCAAAATAGCCCCAAAACAAGACTGA 
CAAAAATCTGAAAAACTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAA 
TTGACTGCCAGGCTGGGTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGG 
TGAGCAAGTCACTTGAGGTCGGGAGTTCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTC 
TCTACTAAAAAT^/JAAAAATCACCCGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCG 
GGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCA 
CACCACTGTATTCCAGCCTGGGTGACTGAGACTCTAACTAA 
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MS FL.QDPS FFTMGMWS I GAGALGAAAIjALLLiANTDVFLS KPQKAALE YLED I DLKTLEKE PR 
TFKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAWKEHIRTEVKDF 
QP YFKGE I FLDEKKKFYGPQRRKMMFMGF I RLGVWYNFFRAWNGGFSGNLEGEGF ILGGVFV 
VGSGKQGILLEHREKEFGDKWLLSVLEAAKMIKPQTLASEKK 
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FIGURE 269 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCG 
GG C C AGGTGCC C CGTCG C AGGTGC C C CTGG C C GGAG AT GCGGTAGG AG GGGCGAGCGC GAG A 
AGCCCCTTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTG 
CTTCTGGCGCTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACA 
GACCACTTCTGCAAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATG 
GCAACCTGCGTCCGGAAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTG 
CTCCTGGCTGTGGGGCTGGCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGG 
CACCTACCGGCCCAGTAGCGAGGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGG 
ACTCCAAGGAGACGGTGCAGGGCTGCCTGCCGATC TAG GTCCGGTCTCCTGCATCTGTCTCC 
CTTCATTGCTGTGTGACCTTGGGGAAAGGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAG 
TGCTTAATAGCAGGGAAGAAGGTACTTCAAAGACTCTGCCCCTGAGGTCAAGAGAGGATGGG 
GCTATTCACTTTTATATATTTATATAAAATTAGTAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVIjPSSTSSSSDGNLRPEAITAIIV 
VFSLLAMjLLAVGIJVLLVRK^^ 
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AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGGTT 
TTGGGATTTTAATTTTCAAACACAGCAGAATGACATTTTTTCTGTCACTATTATTATTGTTG 
GTATGTGAAGCTATTTGGAGATCCAATTCAGGAAGCAACACATTGGAGAATGGCTACTTTCT 
ATCAAGAAATAAAGAGAACCACAGTCAACCCACACAATCATCTTTAGAAGACAGTGTGACTC 
CTACCAAAGCTGTCAAAACCACAGGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGA 
GGGTTAATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGAAAAACACTTAGATTCAATG 
ATTGTAAATTTAAGGCAAATACACATATTAGTATTACCTTAGTGTAATGTATGCCTGTCATA 
TATACAATAAGGTGAAATTATAAGTACCCTATGCAGTTGGCTGGACAGTTCTAAATTGGACT 
TTATTAATTTTTAAAATCAGTAACTGATTTATCACTGGCTATGTGCTTAGATCTACAGGAGA 
TCATATAATTTGATACAAATAAAAGAAAAGTGTTCTCTCCCCTTACAGAATTGACATTTTAA 
ATGCGATACAGTTAGAATAGGAAATATGACATTAGAAAGGAAGAATGACAGGGAGAAAGGAA 
AGAAGGGAAAATGTTGCCAAGGAAAAAAAAA 
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MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGK 
G I VKGRNLDS RGL I LGAEAWGRGVKKNT 
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GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCT 
GTGCCTGCTGCACCAGTCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATA 
TTGTCATTGTTATAGATCCTAGTGTGCCAGAAGATGAAAAAATAATTGAACAAATAGAGGAT 
ATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGATTTTTTTTCAAAAA 
TGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATG 

- AAAACCATAAACATGCTGATGTTATAGTTGCACCAC CTACACTCCCAGGTAGAGATGAACCA 
TACACCAAGCAGTTCACAGAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCT 
TCTACTTGGAAAAAAACAAAATGAATATGGACCAC CAGG CAAACTGTTTGT CCATGAGTGGG 

r v CTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTCTACCGTGCTAAG 
TCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAA 
GTGTCAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATG 
GAAAAGATTGTCAATTCTTTCCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATG 
CAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAAAAAACCCATAATCAAGAAGCTCCAAG 
CCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAATTCTGAGGATT 
TTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATC 
AGTCAAAGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCT 
AAATCGAATGAATCAAGCAGCAAAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGG 
TGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTAAATAAGCTAATCCAAATAAAAAGC 
AGTGATGAAAGAAACACACTCA^CCCAGGATTACCTACATATCCTCTGGGAGGAACTTCCAT 
CTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGAT 
CCGAAGTACTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTG 
AAACAAAGTGGGGCCATTGTTCATTTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAAT 
AGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTTTCAGATGAAGCTCAGAACAATG 

vf GCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAGTCCCTT 

- CAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAAT 
TGATAGTACAGTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTA 
TTTCTCTCTGGGATCCCAGTGGAACAATAATGGAAAATTTCACAGTGGATGCAACTTCCAAA 
ATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACTTGGGCATACAATCTTCAAGC 
CAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCAGCAAATTCTTCTGTGC 
CTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATT 
GTTTACGCAGAAATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCAT 
TGAATCACAGAATGGACATACAGAAGTTTTGGAACTTTTGGATAATGGTGCAGGCGCTGATT 
CTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATATACAGAAAATGGCAGATAT 
AGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCACT 
GAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCAA 
GA CCTGAAATTGATGAGGATACTCAGACC AC CTTGGAGG ATTT CAGC CGAACAGC AT C C GG A 
GGTGCATTTGTGGTATCACAAGTCCCAAGCCTTCCCTTGCCTGACCAATACCCACCAAGTCA 
AATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTTACATGGACAGCACCAG 
GAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTCTCACCAAAGGA 
GGCCAACTCCAAGGAAAGCTTTGCATTTAAACCAGAAAATATCTCAGAAGAAAATGCAACCC 
ACATATTTATTGCCATTAAAAGTATAGATAAAAGCAATTTGACATCAAAAGTATCCAACATT 

^ GCACAAGTAACT^TGTTTATCCCTCAAGCAAATCCTGATGACATTGATCCTACACCTACTCC 
TACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTAT 
TGTCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATTTGAACCTTA 
ACGAAGAAAAAAATCTTCAAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGT 
AAAGGATATTTCTGAATCTTAAAATTCATCCCATGTGTGATCATAAACTCATAAAAATAATT 
TTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATATGTAAAAACTGT 
CAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAAC 
AAAGATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAAT 
GATATTTCAAATTGCATCAAGAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAA 
GG AG AGC AAAT AAAC AAC ATT TG G AAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTY 
LFEATEKRFFFKJWSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTEC 
GEKGEYIHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATR 
CSAGISGRNRVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWE 
FCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLV 
LDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLM 
AGLPTYPLGGTSICSGIKYAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVH 
FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 
LNSNAWMNDTVIIDSTVGKDTFFLITV^SLPPSISLWDPSGTIMENFTVDATSKMAYLSIPG 
TAKVGTWA YNLQ AKANPETLT I TVTS RAANS S VP P I TVNAKMNKDVNS FPSPMI VYAE I LQG 
YVPVLGANVTAFIESQNGHTEVLELLDNGAGADSFKNDGVYSRYFTAYTENGRYSLKVRAHG 
GANTARLKLRP PLNRAAYI PGWWNGE I EANPPRPE I DEDTQTTLEDFSRTASGGAF WS QV 
PSLPLPDQYPPSQITDLDATVHEDKI ILTWTAPGDNFDVGKVQRYI IRISASILDLRDSFDD 
ALQVNTTDLS PKEANS KES FAFKPEN I SEENATH I F I A I KS I DKSNLTSKVSNIAQVTLFI P 
QANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWIVNFILSTTI 
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CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGG 

GCAGGGGTGACAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGAC 

GCCAAGGTAATTTTGACCCAGAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCC 

AGTTATGCCAGGATTTACTAGAGAGTGTCAA.CTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGG 

TTGGAGGAGAGAACCTTTGTGGGGCTGCGXTCTCTTAGCAGTGCTCAGAAGTGACTTGCCTGAGGGTG 

GACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAAGGCTGTGATGGGAATGAA 

GGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAAAGTAGAGA 

AGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGC 

TCCGGGGGCCCCAAACGC^TGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCT 

TTGAGGGATGCCACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCT 

TGCGTGGATTTCCCGGGTGGTGGTTTTGCTGGTGCTCCTCT'jCTGTGCTATCTCTGTCCTGTACATGT 

TGGCCTGCACCCCAAAAGGTGACGAGGAGCAGCTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAG 

GAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCACCGCAACTACGTGAGCAGCCTGAAGCG 

GCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGGAATGGGCAGTACCAAG 

CCAGCGATGCTGCTGGCCTGGGTCTGGACA " , iAGCCCCCCAGAGAAAACCCAGGCCGACCTCCTGGCC 

TTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGC 

AGTGCCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACGCCG 

AGGAGAAGCCTGTGAGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACC 

CTGAACAATCCTGCAGAGAACAGCCCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGAT 

CTACCGAACAGAAAGGGACAAAGGGACATTGTATGAGCTCACCTTCAAAGGGGACCACAAACACGAAT 

TCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAAGTGAAAAATGAAAAGCTCAACATG 

GCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTCCGGCAGTTCATGCA 

GAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGT C CAT CT C ACTGTTGTTTACTTTGGGAAAG 

AAGAAAT AAATGAAGT CAAAGGAATACTTGAAAACACTT C C AAAG CTG C CAACT T C AGGAACTTTAC C 

TTCATCCAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGG 

AAGCAACGTCCTTCTCTTTTTCTGTGATGTGGA CATC T ACT TCACATCTGAATTCCTCAATACGTGTA 

GGCTGAATACACAGCCAGGGAAGAAGGTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATA 

ATATACGGCCACCATGATGCAGTCCCTCCCTTGGAACAGCAGCTGGTCATAAAGAAGGAAACTGGATT 

TTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGACTTCATCAATATAGGTGGGTTTG 

ATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTCCACAGCAACCTC 

ATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAGCT 

GACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGG 

GCATGCTGGTGTTCAGGCACGAGATAGAGGCT CACCTT CG CAAACAGAAACAGAAGACAAGTAGCAAA 

AAAACATGAACTCCCAGAGAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTG 

GCTGCAACAGAGAAAAGACTTCCATAAAGGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAA 

AGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGAAATCAAAATCTCCGCTTTGCCTGCAAAAGT 

AACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTGAGATTATAAGCCTAATGGTG 

TGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGAAATATTCATG 

ATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGA 

GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTA 

TTATGATACTAGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCG 

TGTCATATTTTCCCCAAGATTAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCT 

CCGTTTTTTTCTTTTATTTAAAAATGCACTTTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTA 

CCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTACATTTTTATATTTTTTAAGAAGATACTTT 

GAGATGCATTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATATCCAAGGACATGCCAAATG 

CTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACTAATACAGA 

CGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGC.^iAAGAAAATG 

ACACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAG 

AAACCACATTTTCTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGT 

GTGAACCAAACAATCTCTTTTCAAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATG 

GAGAAAAATATATATATATATATATATATTGTGAAAGATCAATCCATCTGCCAGAATCTAGTGGGATG 

GAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAGTAACTGAATTATTTTTTAAATTAAGC 

AGTTCTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTATTACCATTTCAAACTATTTTTTAAA 

AATAAATACAGTTAACATAGAGTGGTTTCTTCATTCATGTGAAAATTATTAGCCAGCACCAGATGCAT 

GAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAA 

GAACATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTT 

AATTAAAACACAGGCCATGAATGGAAGGTGGTATTG CAC AG CTAATAAAATATGATTTGTGGATATGAA 
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MMMVRRGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQ 
EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFL 
HS Q VDKAE VNAG VKLATE YAAVPFDS FTLQKVYQLE TGLTRH P EE KP VRKD KRDE L VEA I E S 
ALETLNNPAENS PNHRP YT AS DF I EG I YRTERDKGTL YELTFKGDHKHE FKRL I IjFRPF S P I 
MKVKNEKLNMANTLINVIVPIA.KRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVK 
GILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCR 
LiNTQPGKKVFYPVLFSQYNPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFI 
NIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 
KAMNEASHGQLGMLVFRHE I EAHLRKQKQKTS S KKT 



BNSDOCID: <WO._9963088A2 ..!.. > 



■Pi 



WO 99/63088 _ . / PCT/US99/12252 

FIGURE 277 

GAAAGAATGTTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCTGAACTCTGTCAACC 
AGGTGCAGAAAATGCTTTTAAAGTGAGACTTAGTATCAGAACAGCTCTGGGAGATAAAGCAT 
ATGCCTGGGATACCAATGAAGAATACCTCTTCAAAGCGATGGTAGCTTTCTCCATGAGAAAA 
GTTCCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTTTGCAATGTAACCCAGAGGGT 
ATCATTCTGGTTTGTGGTTACAGACCCTTCAAAAAATCACACCCTTCCTGCTGTTGAGGTGC 
AATCAGCCATAAGAATGAACAAGAACCGGATCAACAATGCCTTCTTTCTAAATGACCAAACT 
CTGGAATTTTTAAAAATCCCTTCCACACTTGCACCACCCATGGACCCATCTGTGCCCATCTG 
' l GATTATTATATTTGGTGTGATATTTTGCATCATCATAGTTGCAATTGCACTACTGATTTTAT 
CAGGGATCTGGCAACGTAGAAGAAAGAACAAAGAACCATCTGAAGTGGATGACGCTGAAGAT 
AAGTGTGAAAACATGATCACAATTGAAAATGGCATCCCCTCTGATCCCCTGGACATGAAGGG 
GGGCATATTAATGATGCCTTCATGACAGAGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGT 
TGTTCTGGTTCCTCAAGAAATTAAACATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATA 
I! CCAAGAGCAGATCATATATTTTGTTTCACCATTCTTCTTTTGTAATAAATTTTGAATGTGCT 
m TGAAAGTGAAAAGCAATCAATTATACCCACCAACACCACTGAAATCATAAGCTATTCACGAC 
TCAAAATATTCTAAAATATTTTTCTGACAGTATAGTGTATAAATGTGGTCATGTGGTATTTG 
TAGTTATTGATTTAAGCATTTTTAGAAATAAGATCAGGCATATGTATATATTTTCACACTTC 
AAAGACCTAAGGAAAAATAAATTTTCCAGTGGAGAATACATATAATATGGTGTAGAAATCAT 
TGAAAATGGATCCTTTTTGACGATCACTTATATCACTCTGTATATGACTAAGTAAACAAAAG 
TGAGAAGTAATTATTGTAAATGGATG GAT AAAAATGGAAT TACT CAT ATA CAGGGTGGAATT 
TTATCCTGTTATCACACCAACAGTTGATTATATATTTTCTGAATATCAGCCCCTAATAGGAC 
AATTCTATTTGTTGACCATTTCTACAATTTGTAAAAGTCCAATCTGTGCTAACTTAATAAAG 
TAATAATCATCTCTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MLWLLFFLVTAIHAELCQPGAENAFKWLSIRTALGDKAYAWDTNEEYLFKAMVAFSMRKVP 
NREATE I SHVLLCNVTQRVS FWF WTDP S KNHTL P AVE VQ S A I RMNKNR I NN AF FLNDQTLE 
FLKIPSTLAPPMDPSVPIWI I IFGVIFCI I IVAIALLI LSGIWQRRRKNKEPSEVDDAEDKC 
ENMITIENGIPSDPLDMKGGILMMPS 
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AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGG 
GTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTCATGT 
ATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCTCTTTGG 
CCTATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGGACAGATGC 
TCGGTTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTGGA 
ATTTTCGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTC 
CAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGA 
TGCCTCCATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACATACACCTGCCAGGTGA 
AGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTCAGCGTCGTGCACACTGTA 
CGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGATCAT 
AATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTC 
AT AAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGG CT CAAC CAAG AG AAAAAGGT CT CT 
GTTTATTTAGAAGACACAGACTAACAATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAA 
GAACCCTAGTATTTCTTGAAGTTAATGGAAACTTTTCTTTGGCTTTTCCAGTTGTGACCCGT 
TTTCCAACCAGTTCTGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGCCAGCAC 
AGTGCTCCTCCATATCACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGA 
GTGTAAATTTTTTCAAGTGCTCATTAGGTTTTATAAACAAGAAGCTACATTTTTGCCCTTAA 
GACACTACTTACAGTGTTATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCC 
AATTTGTCTGTTACATTTCCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTA 
ATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTG 
TTTCTGATTAACAGTAAATCCTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTC 
TCCTTAACTATGAGACACATCTTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATT 
TTTGTCG 
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AHKWE I KS KEEERLNQEKKVS VYLEDTD 
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FIGURE 2S1 

GCATTTTTGTCTGTGCTCCCTGATCTTCAGGTCACCACCATGAAGTTCTTAGCAGTCCTGGT 
ACTCTTGGGAGTTTCCATCTTTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTCCAGCTG 
ACACGTATCCAGCTACTGGTCCTGCTGATGATGAAGCCCCTGATGCTGAAACCACTGCTGCT 
GCAACCACTGCGACCACTGCTGCTCCTACCACTGCAACCACCGCTGCTTCTACCACTGCTCG 
TAAAGACATTCCAGTTTTACCCAAATGGGTTGGGGATCTCCCGAATGGTAGAGTGTGTCCCT 
GAGATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACTATTCATGCTTCCTGTGATTTC 
ATCCAACTACTTACCTTGCCTACGATATCCCCTTTATCTCTAATCAGTTTATTTTCTTTCAA 
ATAAAAAATAACTATGAGCAACATAAAAAAT A AAAAA 
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FIGURE 282 

MKFLAVLVLLGVS I FLVSAQNPTTAAPADTYP ATGPADDEAPDAETTAAATTATTAAPTTAT 
TAASTTARKDI PVLPKWVGDLPNGRVCP 
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FIGURE 283 

GGACTCTGAAGGTCCCAAGCAGCTGCTGAGGCCCCCAAGGAAGTGGTTCCAACCTTGGACCC 
. m CTAGGGGTCTGGATTTGCTGGTTAACAAGATAACCTGAGGGCAGGACCCCATAGGGGAATGC 
. m TACCTCCTGCCCTTCCACCTGCCCTGGTGTTCACGGTGGCCTGGTCCCTCCTTGCCGAGAGA 

GTGTCCTGGGTCAGGGACGCAGAGGACGCTCACAGACTCCAGCCCTTTGTTACCGAGAGGAC 

if ACTTGGCAAGGTCCAGCGATGGTCCGGAGTCCACACACAGACTGGCGGCAGGGCAGGAGGGG 

1 

GACAGTTCTGTTGTGCTTGGTTGGACAGTAAGAGGGTCTTGGCCAGTCCAGGGTGGGGGGCG 
GCAAACTCCATAAAGAACCAGAGGGTCTGGGGCCCGGCCACAGAGTCATCTGCCCAGCTCCT 
CTGCTGCTGGCCAGTGGGAGTGGCACGAGGTGGGGCTTTGTGCCAGTAAAACCACAGGCTGG 
. ATTTGCCTGCGGGCCATGGTCCCTGTCTAGGGCAGCAATTCTCAACCTTGTTGCTCTCAGGA 
CCCCAAAGAGCTTTCATTGTATCTATTGATTTTTACCACATTAGCAATTAAAACTGAGAAAT 
GGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGAT 
CACCTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTACTAAAAA 
§ TACAAAAAATTAGCCAGGCACAGTGGTGTGCACTGGTAGTCCCAGTTACTCGGGAGGCTGAG 
^ GCAGGAAAATCGCTTGAACCCAGGAGGCGGACGTTGCGGTGAGCCGAGATCGCGCCGCTGAT 
TCCAGCCTGGGCGACAAGAGTGAGACTCCATCTCACACA 
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ML P P AIjP PALVFTVAWS LLAERVS WVRDAE DAHRLQ P F VTERTLGKVQ RW SG VHTQTGGR AG 
GGQ FC CAWLD S KRVLAS PGWGAANS I KNQRWAPATESS AQLLCCWPVGVARGGALCQ 
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GTCATGCCAGTGCCTGCTCTGTGCCTGCTCTGGGCCCTGGCAATGGTGACCCGGCCTGCCTCA 
GCGGCCCCCATGGGCGGCCCAGAACTGGCACAGCATGAGGAGCTGACCCTGCTCTTCCATGG 
GACCCTGCAGCTGGGCCAGGCCCTCAACGGTGTGTACAGGACCACGGAGGGACGGCTGACAA 
AGGCCAGGAACAGCCTGGGTCTCTATGGCCGCACAATAGAACTCCTGGGGCAGGAGGTCAGC 
1 CGGGGCCGGGATGCAGCCCAGGAACTTCGGGCAAGCCTGTTGGAGACTCAGATGGAGGAGGA 

i 

TATTCTGCAGCTGCAGGCAGAGGCCACAGCTGAGGTGCTGGGGGAGGTGGCCCAGGCACAGA 
AGGTGCTACGGGACAGCGTGCAGCGGCTAGAAGTCCAGCTGAGGAGCGCCTGGCTGGGCCCT 
GCCTACCGAGAATTTGAGGTCTTAAAGGCTCACGCTGACAAGCAGAGCCACATCCTATGGGC 
CCTCACAGGCCACGTGCAGCGGCAGAGGCGGGAGATGGTGGCACAGCAGCATCGGCTGCGAC 
AGATCCAGGAGAGACTCCACACAGCGGCGCTCCCAGCCTGAATCTGCCTGGATGGAACTGAG 
GACCAATCATGCTGCAAGGAACACTTCCACGCCCCGTGAGGCCCCTGTGCAGGGAGGAGCTG 
CCTGTTCACTGGGATCAGCCAGGGCGCCGGGCCCCACTTCTGAGCACAGAGCAGAGACAGAC 
|f GCAGGCGGGGACAAAGGCAGAGGATGTAGCCCCATTGGGGAGGGGTGGAGGAAGGACATGTA 
V -' C C C TTT C ATG C C T AC AC AC C C CT C ATT AAAG C AG AG T C G T GG C ATT T C AAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 286 

MPVPALCLLWALAIWTRPASAAPMGGPELAQHEELTLLFHGTLQLGQALNGVyRTTEGRLTK 
ARNSLGLYGRTIELLGQEVSRGRDAAQELRASLLETQMEEDILQLQAEATAEVLGEVAQAQK 
VLRDSVQRLEVQLRSAWLGPAYREFEVLKAHADKQSHILWALTGHVQRQRREMVAQQHRLRQ 
IQERLHTAALPA 
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FIGURE 287 



, GGCAACATSGCTCAGCAGGCTTGCCCCAGAGCCATGGCAAAGAATGGACTTGTAATTTGCAT 
CCTGGTGATCACCTTACTCCTGGACCAGACCACCAGCCACACATCCAGATTAAAAGCCAGGA 
AGCACAGCAAACGTCGAGTGAGAGACAAGGATGGAGATCTGAAGACTCAAATTGAAAAGCTC 
TGGACAGAAGTCAATGCCTTGAAGGAAATTCAAGCCCTGCAGACAGTCTGTCTCCGAGGCAC 

I TAAAGTTCACAAGAAATGCTACCTTGCTTCAGAAGGTTTGAAGCATTTCCATGAGGCCAATG 
AAGACTGCATTTCCAAAGGAGGAATCCTGGTTATCCCCAGGAACTCCGACGAAATCAACGCC 
CTCCAAGACTATGGTAAAAGGAGCCTGGCAGGTGTCAATGACTTTTGGCTGGGCATCAATGA 
CATGGTCACGGAAGGCAAGTTTGTTGACGTCAACGGAATCGCTATCTCCTTCCTCAACTGGG 
ACCGTGCACAGCCTAACGGTGGCAAGCGAGAAAACTGTGTCCTGTTCTCCCAATCAGCTCAG 
GGCAAGTGGAGTGATGAGGCCTGTCGCAGCAGCAAGAGATACATATGCGAGTTCACCATCCC 
TAA ATAG GTCTTTCTCCAATGTGTCCTCCAAGCAAGATTCATCATAACTTATAGGTTCATGA 
TCTCTAAGATCAAGTAAAAATCATAATTTTTACTTATT7VAAAAATTGCAACACAAGATCAAT 

3 GTCCATAGCAATATGATAGCATCAGCCAATTTTGCTAACACATTTCTTTGGGATTTTGCCCT 

A TCCTGGGGTATAGGGGATCAGAAATATTGATCCATGTGCACGCAGATAAAATGGCTTCTGCT 
AAACAGACTAAAATCTTTCTCTCTAGTCTTTCTCACTTGTACAAACCCAGTTTGTTTTCAAA 
AAATCACAGTAGCAATGCAACTCATCACTCTAGAAAAGCAAGCTTAGGCTACCTGAAAGATT 
TTCCCTTGGAAGTTTAGCGTATGTTTGACTAACAAAAATTCCCTACATCAGAGACTCTAGGT 
GCTATATAATCCAAAAACTTTTCAGCCTGTTGCTCATTCTGTCCCATGCTGGCAATAATACC 
TTGTCAGCCCATTACCCTTATTTTGAATTGCTCCATCTCCTGGTGGGACTTGTATCTTGTCT 
GCCATATCAGAACACAAACCCCTGAAGAGGTTCTGATTTGATTTTTTTTTTTTCTTCATGCC 
TACCCTTTTTTTGGAAGTTTCCAGCCGCAATTTGAAATGAAATGACAAGGTGTATATTTGAT 
CAATTTTCATTCCCACCATTGCATTACAACCTCTAACTTAAATGGGTAACCCTAAGGCATAT 

m CAAAGAAG CAGATT Z CATGATAAACGGAAATAGAAAAAAAGAACCTACATTTATTTTGCTTT 

AGCATCCTTACTCTCACCTTTTATGAGATTGAGAGTGGACTTACATTTCCTTTTTTACATTT 
, TCGTATATTTATTTTTTTTAGCCATCATTATATGTTTAAGTCTATTATGGGCAACCAATCTT 
TGGAAGCTGAAAACTGAATTTAAAGAATGCTATCTTGGAAAATTGCATACGTCTGTGCAATT 
TTTTATTCTGCCTAGTGCTATTCTGCTTGTTTAACTAGATTGTACAAAATAACTTCATTGCT 
TAATATCAAATTACAAAGTTTAGACTTGGAGGGAAATGGGCTTTTTAGAAGCAAACAATTTT 
AAATATATTTTGTTCTTCAAATAAATAGTGTTTAAACATTGAATGTGTTTTGTGAACAATAT 
CCCACTTTGCAAACTTTAACTACACATGCTTGGAATTAAGTTTTAGCTGTTTTCATTGCTCA 
ATAATAAAGCCTGAATTCTGATCAATAAAAAAAAAAAAAAAAAAAAAAAA 



BNSDOCfD: < WO 9963088 A2 _ I 



WO 99/63088 ^C^/^V O PCT/US99/12252 

FIGURE 288 

MAQQACPRAMAKNGLVICILVITLLLDQTTSHTSRLKARKHSKRRVRDKDGDLKTQIEKLWT 
EVNALKE I QALQTVCLRGTKVHKKCYLASEGLKHPHEANEDC I S KGG I LV I PRNSDE INALQ 
DYGKRSLPGVNDFWLGINDMVTEGKFVDVNGIAISFLNWDRAQPNGGKRENCVLFSQSAQGK 
WSDEACRSSKRYICEFTIPK 
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GCGAGGACCGGGTATAAGAAGCCTCGTGGCCTTGCCCGGGCAGCCGCAGGTTCCCCGCGCGC 
CCCGAGCCCCCGCGCCATGAAGCTCGCCGCCCTCCTGGGGCTCTGCGTGGCCCTGTCCTGCA 
GCTCCGCTGCTGCTTTCTTAGTGGGCTCGGCCAAGCCTGTGGCCCAGCCTGTCGCTGCGCTG 
GAGTCGGCGGCGGAGGCCGGGGCCGGGACCCTGGCCAACCCCCTCGGCACCCTCAACCCGCT 
| GAAGCTCCTGCTGAGCAGCCTGGGCATCCCCGTGAACCACCTCATAGAGGGCTCCCAGAAGT 
1 GTGTGGCTGAGCTGGGTCCCCAGGCCGTGGGGGCCGTGAAGGCCCTGAAGGCCCTGCTGGGG 
GCCCTGACAGTGTTTGGCTgAGCCGAGACTGGAGCATCTACACCTGAGGACAAGACGCTGCC 
CACCCGCGAGGGCTGAAAACCCCGCCGCGGGGAGGACCGTCCATCCCCTTCCCCCGGCCCCT 
CTCAATAAACGTGGTTAAGAGCAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAA 
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MKLAAliLtGLCVAIiSCSSAAAFLVGSAKPVAQPVAALESAAEAGAGTLANPLGTLNPLKLLLS 
SLGIPVNHLIEGSQKCVAELGPQAVGAVKALKALLGALTVFG 
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TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCACT 
CCTTGGCCTCCGCAGCCGATCACATGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTG 
GCACAGGTGTGGCTGGTACCCGGCTT^CCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCC 
TCAGAACCAGACCAGCAGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGG 
C C AGCGAGG AG AAGGCCGGT GAGG AAGAGAAAG C CT GG CT GATGGC C AGC AGG C AG C AGCTT 
GCCAAGGAGACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGG 
CAACATGGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATGCTGGGGGCCA 
CAGGGCCGACTGAAACCCAGATCAAGAGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAG 
CCCGGGCTCCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACT 
GGGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTCT 
TCAATTTATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCCTCA 
CAGGCCAAAAGGCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCCAAACT 
GTTTGATGAGATTAATCCTGAAACCAAATTAATTCTTGTGGATTACATCTTGTTCAAAGGGA 
AATGGTTGACCCCATTTGACCCTGTCTTCACCGAAGTCGACACTTTCCACCTGGACAAGTAC 
AAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCTTTGACAAGAA 
TTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCA 
TGGAGAAAAT GGGTG AC CACCTCGCCCTTGAAG ACT A CCTGACC AC AG ACTTGGTGG AG AC A 
TGGCTCAGAAACATGAAAACCAGAAACATGGAAGTTTTCTTTCCGAAGTTCAAGCTAGATCA 
GAAGTATGAGATGCATGAGCTGCTTAGGCAGATGGGAATCAGAAGAATCTTCTCACCCTTTG 
CTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGA 
ACAGTGATTGAAGTTGATGAAAGGGGCACTGAGGCAGTGGCAGGAATCTTGTCAGAAATTAC 
TGCTTATTCCATGCCTCCTGTCATCAAAGTGGACCGGCCATTTCATTTCATGATCTATGAAG 
AAACCTCTGGAATGCTTCTGTTTCTGGGCAGGGTGGTGAATCCGACTCTCCTA TAA TTCAGG 
ACATGCATAAGCACTTCGTGCTGTAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGA 
TACCAGCAATGGATGGCAGGGGAGAGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTGTTCTC 
AAATAAATACAGTAGTCCCCACTTATCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGC 
CTGAAACGGTGGACAGTGCTGAACCTTATATATATTTTTTCCTACACATACATACCTATGAT 
AAAGTTTAATTTATAAATTAGGCACAGTAAGAGATTAACAATAATAACAACATTAAGTAAAA 
TGAGTTACTTGAACGCAAGCACTGCAATACCATAACAGTCAAACTGATTATAGAGAAGGCTA 
CTAAGTGACTCATGGGCGAGGAGCATAGACAGTGTGGAGACATTGGGCAAGGGGAGAATTCA 
CATCCTGGGTGGGACAGAGCAGGACGATGCAAGATTCCATCCCACTACTCAGAATGGCATGC 
TGCTTAAGACTTTTAGATTGTTTATTTCTGGAATTTTTCATTTAATGTTTTTGGACCATGGT 
TGACCATGGTTAACTGAGACTGCAGAAAGCAAAACCATGGATAAGGGAGGACTACTACAAAA 
GCATTAAATTGATACATATTTTTTAAAAAAAAAAAAAAAAAAA 
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MKWPSLLLSVLLAQVWLVPGLAPSPQSPETPAPQNQTSRWQAPREEEEDEQEASEEKAGE 
EEKAWLI^SRQQLAKETSNFGFSLLRKISMRH^ 

KRGLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYF 

DTECVPMNFRNASQAKRLMNHYINKETRGKIPKLFDEINPETKLILVDYILFKGKWLTPFDP 
; y VFTEVDTFHLDKYKTIKVPMMYGAGKFASTFDKNFRCHVLKLPYQGNATMLWLMEKMGDHL 
M ALEDYLTTDLVETWLR1MKTRNMEVFFPICFKLDQKYEMHELLRQMGIRRIFSPFADLSELSA 

TGRNLQVSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPFHFMIYEETSGMLLF 

LGRWNPTLL 
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FIGURE 293 

CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTACAGAGACGCGGAGCCCAGACATGAG 
GAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTCCCAG 
^ CACCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGACCCAGAGAAG 
GCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTGCTGTTCCC 
| TGTCCAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGCAGGGGCCCCA 
^ TCCTTCCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGTGTCCTGAGTCCC 
GAGCCCGACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGACCAGGGCGAGGAGAGGCC 
CCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAGGAAGACCAAGACCACA 
TCTACCACCCCCAGTAGGGCTCCAGGGGCCATCACTGCCCCCGCCCTGTCCCAAGGCCCAGG 
CTGTTGGGACTGGGACCCTCCCTACCCTGCCCCAGCTAGACAAATAAACCCCAGCAGGCAAA 
AAAAAAAAAAAAAAAA 
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FIGURE 294 

MRRLLLVTSLWVLLWEAGAVPAPKVP I KMQVKHWP SEQD PE KAWGARVVEPPEKDDQLVVL 
FPVQKPKLLTTEEKPRGQGRGP ILPGTKAWMETEDTLGRVLS PE PDHDSL YHPPPEEDQGEE 
RPRLWVMPNHQ VKLGPEEDQDH I YHPQ 
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AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTG 
TACCCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGC 
TGTTTCT CAT AG CG AC C AC C AG AG GATGGAGTACAGATGAGGCTAAT AC TTACTTCAAGGAA 
TGGACCTGTTCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCC 
■i TAGTGCATTTGATGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCT 
| GTGACATGACCTCTGGGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATG 
CGTGGGAAGTGCACGGTGGGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCC 
AGAGGGGGACGGCAACTGGGCCAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCG 
ATGACTACAAGAACCCTGGCTACTACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTG 
CCCAATAAGTCCCCCATGCAGCACTGGAGAAACAGCTCCCTGCTGAGGTACCGCACGGACAC 
TGGCTTCCTCCAGACACTGGGACATAATCTGTTTGGCATCTACCAGAAATATCCAGTGAAAT 
ATGGAG7VAGGAAAGTGTTGGACTGACAACGGCCCGGTGATCCCTGTGGTCTATGATTTTGGC 
GACGCCCAGAAAACAGCATCTTATTACTCACCCTATGGCCAGCGGGAATTCACTGCGGGATT 
TGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAACGCCTTGTGTGCTGGAATGAGGG 

•g 

§ TCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAGGATACTTTCCAGAGGCCAGT 
CCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATATGGAACTCATGTTGGTTA 
CAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCGT TGA GAGTTTTGTG 
GGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAACAACTTACCCA 
GTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGAAAAAAA 
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MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTEN 
GVIYQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGISrWANYNTFG 
SAEAATSDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGI 
YQKYPVKYGEGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAAN 
ALCAGMRVTGCNTEHHCIGGGGYFPELASPQQCGDFSGFDWSGYGTET^GYSSSREITEAAVLLFYR 




BNSDOCID: <WO .9963088 A2. 1 _> 



WO 99/63088 ^ , / PCT/US99/12252 

;* 3o-f/3vo 

FIGURE 297 

GCGGAGCCGGCGCCGGCTGCGCAGAGGAGCCGCTCTCGCCGCCGCCACCTCGGCTGGGAGCC 
CACGAGGCTGCCGCATCCTGCCCTCGGAACAATGGGACTCGGCGCGCGAGGTGCTTGGGCCG 
CGCTGCTCCTGGGGACGCTGCAGGTGCTAGCGCTGCTGGGGGCCGCCCATGAAAGCGCAGCC 
ATGGCGGCATCTGCAAACATAGAGAATTCTGGGCTTCCACACAACTCCAGTGCTAACTCAAC 

I AGAGACTCTCCAACATGTGCCTTCTGACCATACAAATGAAACTTCCAACAGTACTGTGAAAC 

1 

n CACCAACTTCAGTTGCCTCAGACTCCAGTAATACAACGGTCACCACCATGAAACCTACAGCG 
GCATCTAATACAACAACACCAGGGATGGTCTCAACAAATATGACTTCTACCACCTTAAAGTC 

4 TACACCCAAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCCACAATGACCG 
TAACCCACAATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGCAT 
TCTGAAGCAAAGAAAGGATCAAAATTTGATACTGGGAGCTTTGTTGGTGGTATTGTATTAAC 
GCTGGGAGTTTTATCTATTCTTTACATTGGATGCAAAATGTATTACTCAAGAAGAGGCATTC 
GGTATCGAACCATAGATGAACATGATGCCATCATTTAAGGAAATCCATGGACCAAGGATGGA 

m ATACAGATTGATGCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCT 

^ CTTTTTGAAAATAGTATAAACAGGCCATGCATATAATGTACAGTGTATTACGTAAATATGTA 
AAGATTCTTCAAGGTAACAAGGGTTTGGGTTTTGAAATAAACATCTGGATCTTATAGACCGT 
TCATACAATGGTTTTAGCAAGTTCATAGTAAGACAAACAAGTCCTATCTTTTTTTTTTGGCT 
GGGGTGGGGGCATTGGTCACATATGACCAGTAATTGAAAGACGTCATCACTGAAAGACAGAA 
TGCCATCTGGGCATACAAATAAGAAGTTTGTCACAGCACTCAGGATTTTGGGTATCTTTTGT 
AGCTCACATAAAGAACTTCAGTGCTTTTCAGAGCTGGATATATCTTAATTACTAATGCCACA 
CAGAAATTATACAATCAAACTAGATCTGAAGCATAATTTAAGAAAAACATCAACATTTTTTG 
TGCTTTAAACTGTAGTAGTTGGTCTAGAAACAAAATACTCC 
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FIGURE 298 
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TNETSNSTVKPPTSVASDSSNTTVTTMKPTAASNTTTPGMVSTNMTSTTLKSTPKTTSVSQN 
TSQISTSTMTVTHNSSVTSAASSVTITTTMHSEAKKGSKFDTGSFVGGIVLTLGVLSILYIG 
CKMY YSRRG I R YRT I DEHD A I I 
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FIGURE 299 

CAGCCGGGTCCCAAGCCTGTGCCTGAGCCTGAGCCTGAGCCTGAGCCCGAGCCGGGAGCCGG 
TCGCGGGGGCTCCGGGCTGTGGGACCGCTGGGCCCCCAGCGATGGCGACCCTGTGGGGAGGC 
CTTCTTCGGCTTGGCTCCTTGCTCAGCCTGTCGTGCCTGGCGCTTTCCGTGCTGCTGCTGGC 
GCAGCTGTCAGACGCCGCCAAGAATTTCGAGGATGTCAGATGTAAATGTATCTGCCCTCCCT 
ATAAAGAAAATTCTGGGCATATTTATAATAAGAACATATCTCAGAAAGATTGTGATTGCCTT 
CATGTTGTGGAGCCCATGCCTGTGCGGGGGCCTGATGTAGAAGCATACTGTCTACGCTGTGA 
ATGCAAATATGAAGAAAGAAGCTCTGTCACAATCAAGGTTACeATTATAATTTATCTCTCCA 
TTTTGGGCCTTCTACTTCTGTACATGGTATATCTTACTCTGGTTGAGCCCATACTGAAGAGG 
CGCCTCTTTGGACATGCACAGTTGATACAGAGTGATGATGATATTGGGGATCACCAGCCTTT 
TGCAAATGCACACGATGTGCTAGCCCGCTCCCGCAGTCGAGCCAACGTGCTGAACAAGGTAG 
AATATGCACAGCAGCGCTGGAAGCTTCAAGTCCAAGAGCAGCGAAAGTCTGTCTTTGACCGG 
CATGTTGTCCTCAGCTAATTGGGAATTGAATTCAAGGTGACTAGAAAGAAACAGGCAGACAA 
CTGGAAAGAACTGACTGGGTTTTGCTGGGTTTCATTTTAATACCTTGTTGATTTCACCAACT 
GTTGCTGGAAGATTCAAAACTGGAAGCAAAAACTTGCTTGATTTTTTTTTCTTGTTAACGTA 
ATAATAGAGACATTTTTAAAAGCACACAGCTCAAAGTCAGCCAATAAGTCTTTTCCTATTTG 
TGACTTTTACTAATAAAAATAAATCTGCCTGTAAATTATCTTGAAGTCCTTTACCTGGAACA 
AGCACTCTCTTTTTCACCACATAGTTTTAACTTGACTTTCAAGATAATTTTCAGGGTTTTTG 
TTGTTGTTGTTTTTTGTTTGTTTGTTTTGGTGGGAGAGGGGAGGGATGCCTGGGAAGTGGTT 
AACAACTTTTTTCAAGTCACTTTACTAAACAAACTTTTGTAAATAGACCTTACCTTCTATTT 
TCGAGTTTCATTTATATTTTGCAGTGTAGCCAGCCTCATCAAAGAGCTGACTTACTCATTTG 
ACTTTTGCACTGACTGTATTATCTGGGTATCTGCTGTGTCTGCACTTCATGGTAAACGGGAT 
CTAAAATGCCTGGTGGCTTTTCACAAAAAGCAGATTTTCTTCATGTACTGTGATGTCTGATG 
CAATGCATCCTAGAACAAACTGGCCATTTGCTAGTTTACTCTAAAGACTAAACATAGTCTTG 
GTGTGTGTGGTCTTACTCATCTTCTAGTACCTTTAAGGACAAATCCTAAGGACTTGGACACT 
TGCAATAAAGAAATTTTATTTTAAACCCAAGCCTCCCTGGATTGATAATATATACACATTTG 
TCAGCATTTCCGGTCGTGGTGAGAGGCAGCTGTTTGAGCTCCAATATGTGCAGCTTTGAACT 
AGGGCTGGGGTTGTGGGTGCCTCTTCTGAAAGGTCTAACCATTATTGGATAACTGGCTTTTT 
TCTTCCTATGTCCTCTTTGGAATGTAACAATAAAAATAATTTTTGAAACATCAA 
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GCACCTGCGACCACCGTGAGCAGTCATGGCGTACTCCACAGTGCAGAGAGTCGCTCTGGCTT 
CTGGGCTTGTCCTGGCTCTGTCGCTGCTGCTGCCCAAGGCCTTCCTGTCCCGCGGGAAGCGG 
CAGGAGCCGCCGCCGACACCTGAAGGAAAATTGGGCCGATTTCCACCTATGATGCATCATCA 
CCAGGCACCCTCAGATGGCCAGACTCCTGGGGCTCGTTTCCAGAGGTCTCACCTTGCCGAGG 
4 CATTTGCAAAGGCCAAAGGATCAGGTGGAGGTGCTGGAGGAGGAGGTAGTGGAAGAGGTCTG 
j| ATGGGGCAGATTATTCCAATCTACGGTTTTGGGATTTTTTTATATATACTGTACATTCTATT 
TAAGGTAAGTAGAATCATCCTAATCATATTACATCAATGAAAATCTAATATGGCGATAAAAA 
TCATTGTCTACATTAAAACTTCTTATAGTTCATAAAATTATTTCAAATCCATCATCTCTTTA 
AATCCTGCCTCCTCTTCATGAGGTACTTAGGATAGCCATTATTTCAGTTTCACATAAGAATG 
TTTACTCAATGTTTAAGTGTTTTGCCCCAAAATTCACAACTAACAAGGCAGAACTAGGACTT 
GAACATGGATCTTTTGGTTCTTAATCCAGTGAGTGATACAATTCAATGCACTCCCCTGCCA 
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FIGURE 303 



CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTT 
GGATTTGAAAGTTGAGAGCAGCATGTTTTGCCCACTGAAACTCATCCTGCTGCCAGTGTTAC 
TGGATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTG 
GGTGATTCAGCTCTGATGGGATGTGTTTTCCAGAGCACAGAAGACAAATGTATATTCAAGAT 
AG ACTGGACTCTGTC AC C AGGAGAGCACGCCAAGG A CG AATATGTGCTAT ACT ATT ACTCCA 
ATCTCAGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGC 
AATGATGGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGA 
AATrCGCCTCAAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGCTTCCAG 
AGGAGCCCAAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAG 
AGCACAGAAGTGAAACACGTGACCAAGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGA 
GGAGATTGTATTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCCAGAGCTGGG 
GCCACTTCCAGAATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATG 
CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAACCT 
GGTGTTCAAGAAAACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCC 
CGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTC 
TGTGCCACAATCCTGCTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAA 
GAGTTCAGTGAATTCTACAGTCTTGGTGAAGAACACGAAGAAGACTAATCCAGAGATAAAAG 
AAAAACCCTGCCATTTTGAAAGATGTGAAGGGGAGAAACACATTTACTCCCCAATAATTGTA 
CGGGAGGTGATCGAGGAAGAAGAACCAAGTGAAAAATCAGAGGCCACCTACATGACCATGCA 
CCCAGTTTGGCCTTCTCTGAGGTCAGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGG 
GAAT GC C AAAAAC AC AG CAAGCCTTTTQAGAAGAATGG AG AG TCCCTT CATC TC AG C AG CGG 
TGGAGACTCTCTCCTGTGTGTGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTC 
CCAGCTGTCCTCCTGTCTCATTGTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGG 
CAGAGAGACTGGACAGCTCTGGAGGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGC 
CTCTGGAGTGGGACACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTT 
GGATCAGACCCTCcTGTGGGCAGGGTTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATA 
AAAACCAACCCAAATCAA 
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MFCPLKLILLPVLLDYSLGLNDLWSPPELTVHVGDSALMGCVFQSTEDKCIFKlDWTIiSPG 
EHAKDE YVLY YYSNLiS VP I GRFQNRVHLMGD I LCNDGS LLLQDVQEADQGT Y I CE I RLKGES 
QVFKKAWLHVL PEE P KELMVHVGGL I QMGCVFQ S TE VKHVTKVE W I F S GRRAKEE I VFRY Y 
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M LVKNTKKTNPEIKEKPCHFERCEGEKHIYSPI IVREVIEEEEPSEKSEATYMTMHPVWPSLR 
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FIGURE 305 

CTATGAAGAAGCTTCCTGGAAAACAATAAGCAAAGGAAAACAAATGTGTCCCATCTCACATG 
GTTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGG 
AAACTACATTTTGCAAAGTCATTGAACTCTGAGCTCAGTTGCAGTACTCGGGAAGCCATgCA 
GGATGAAGATGGATACATCACCTTAAATATTAAAACTCGGAAACCAGCTCTCGTCTCCGTTG 
GCCCTGCATCCTCCTCCTGGTGGCGTGTGATGGCTTTGATTCTGCTGATCCTGTGCGTGGGG 
ATGGTTGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCAATTACCTACAAGA 
TGAGAATGAAAATCGCACAGGAACTCTGCAACAATTAGCAAAGCGCTTCTGTCAATATGTGG 
TAAAACAATCAGAACTAAAGGGCACTTTCAAAGGTCATAAATGCAGCCCCTGTGACACAAAC 
TGGAGATATTATGGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAGAGAG 
TAAGCAGTACTGCACTGACATGAATGCTACTCTCCTGAAGATTGACAACCGGAACATTGTGG 
AGTACATCAAAGCCAGGACTCATTTAATTCGTTGGGTCGGATTATCTCGCCAGAAGTCGAAT 
GAGGTCTGGAAGTGGGAGGATGGCTCGGTTATCTCAGAAAATATGTTTGAGTTTTTGGAAGA 
TGGAAAAGGAAATATGAATTGTGCTTATTTTCATAATGGGAAAATGCACCCTACCTTCTGTG 
AGAACAAACATTATTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCT 
TAATGCAAAGAGGTGGACAGGATAACACAGATAAGGGCTTTATTGTACAATAAAAGATATGT 
ATGAATG CAT C AG TAG CTGAAAAAAAAAAAAAA 
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FIGURE 306 

MQDEDG Y I TLN I KTRKPAL VS VG PAS SS WWRVMAL I LL I LCVGMWGLVALG I WSVMQRN YL 
QDENENRTGTLQQIiAKRFCQYVVKQS EL KG TFKGHKCSPCDT3WRYYGDSCYGFFRHNLT WE 
ESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISENMFEFL 
EDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP 
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